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MFTHOD OF SYhTTHESIZING DTWRSE CQ LLECTTONS OF OLIGOMERS 

FIELD OF THE INVENTION 
The present invention relates generally to stochastic methods for 
5 synthesizing random oligomers, with particular emphasis on particle-based synthesis 
methods. The invention also relates to the use of identification tags on the particles 
to facilitate identification of the oligomer sequence synthesized. Yet another aspect of 
the invention relates to the use of tagged oligomer libraries in receptor-binding 
studies. 

10 

BACKGROUND OF THE INVENTION 
The relationship between structure and activity of molecules is a 
fundamental issue in the study of biological systems. Structure-activity relationships 
are important in understanding, for example, the function of enzymes, the ways in 

1 5 which cells communicate with each other, and ceUular conti-ol and feedback systems. 

Certain macromolecules are known to interact and bind to other molecules having a 
very specific three-dimensional spatial and electronic distribution. Any large 
molecule having such specifidty can be considered a receptor, whether the molecule 
is an enzyme catalyzing hydrolysis of a metaboUc intermediate, a cell-surface protein 

2 0 mediating membrane ti-ansport of ions, a glycoprotein serving to identify a particular 

cell to its neighbors, an IgG-dass antibody circulating in the plasma, an 
oUgonudeotide sequence of DNA in the genome, or the Uke. The various molecules 
that receptors selectively bind are known as ligands. 

Many assays are available for measuring the binding affinity of known 

2 5 receptors and ligands, but the information tiiat can be gained from such experiments 

is often limited by the number and type of available ligands. Novel ligands are 
sometimes discovered by chance or by application of new techniques for the 
eluddation of molecular stinicture, induding x-ray crystallographic analysis and 
recombinant genetic techniques for proteins. 

3 0 Small peptides are an exemplary system for exploring ihe relationship 

between structure and function in biology. A peptide is a polymer composed of 
amino add monomers. When the twenty natiirally occurring amino adds are 
condensed into polymeric molecules, the resulting polymers form a wide variety of 
three-dimensional configurations, each resulting from a partiailar amino add 
3 5 sequence and solvent condition. The number of possible pentapeptides of the 20 
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naturaUy occurring amino adds, for example, is 205 or 3^ miUion different peptides. 
The likelihood that molecules of this size might be useful in receptor-bindmg studies 
is supported by epitope analysis studies showing that some antibodies recognize 
sequences as short as a few amino adds with high spedfidty. Furthermore, the 
5 average molecular weight of amino adds puts small peptides in the size range of 
many currently useful pharmaceutical products. Of course, larger peptides may be 
necessary for many purposes, and polypeptides having dxanges in only a small 
number of residues may also be useful for sudi purposes as the analysis of structure- 
activity relationshqjs. 
1 0 Pharmaceutical drug discovery is one type of research that relies on 

studies of structure-activity relationships. In most cases, contemporary 
pharmaceutical researdi can be described as the process of discovering novel Ugands 
with desirable patterns of spedfidty for biologicaUy important receptors. Another 
example is research to discover new compounds for use in agriculture, such as 

1 5 pestiddes and herbiddes. 

Prior methods of preparing large numbers of different oUgomers have 
been painstaldngly slow when used at a scale suffident to permit effective rational or 
random screening. For example, the "Merrifield" method (Merrifield, J. Am. Chem. 
Soc 85:2149-2154 (1963), which is incorporated herein by reference) has been used 

20 to synthesize peptides on a solid support. In the Merrifield method, an amino add is 
covalentiy bonded to a support made of an insoluble polymer. Another ammo aad 
with an alpha protected group is reacted with the covalentiy bonded amino add to 
form a dipeptide. The protective group is removed, and a third amino add with an 
alpha protective group is added to the dipeptide. This process is continued until a 

2 5* peptide of a desired lengtii and sequence is obtained. Using the Merrifield metiiod, 

one cannot economically and practically syntiiesize more tiian a few peptide 

sequences in a day. 

To syntiiesize larger numbers of oUgomer sequences, otiiers have 
proposed tiie use of a series of reaction vessels for oUgomer synfliesis. For example, a 

3 0 tubular reactor system may be used to syntiiesize a linear oligomer on a soUd phase 

support by automated sequential addition of reagents. This metiiod stiU does not 
enable tiie synthesis of a suffidentiy large number of oUgomer sequences for effective 

economical screening. 

Metiiods of preparing a pluraUty of oUgomer sequences are also known 
3 5 in whidi a foraminous container endoses a known quantity of reactive soUd 
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supports^ the solid supports being larger in size than openings of the container. See 
U.S. Patent No. 4,631^11, incorporated herein by reference. The containers may be 
selectively reacted with desired materials to synthesize desired sequences of product 
molecules. As with other methods known in the art, this method cannot practically 
5 be used to synthesize a sufficient variety of polypeptides for effective screening. 

Other techniques have also been described. One bead-based method is 
described in PCT patent publication No. 92/00091, incorporated herein by reference. 
These methods include the synthesis of peptides on 96 plastic pins that fit the format 
of standard microtiter plates. See PCT patent publications 84/03564; 86/00991; and 
1 0 86/06487, each of which is incorporated herein by reference. Unfortunately, while 
these techniques have been somewhat useful, substantial problems remain. For 
example, these methods continue to be lunited in the diversity of sequences which 
can be economically synthesized and screened. 

Others have developed recombinant methods for preparing collections 

1 5 of oUgomere. See PCT patent pubUcation Nos. 91/17271 and 91/19818, each of which is 

incorporated herein by reference. In another important development, scientists 
combined the techniques of photolithography, chemistry, and biology to create large 
collections of oligomers and other compounds on the surface of a substrate. See U.S. 
Patent No. 5,143,854 and PCT patent pubUcation Nos. 90/15070 and 92/10092, each of 

2 0 which is incorporated herin by reference. 

In the recombinant and VLSIPS™ combinatorial methods, one can 
uniquely identify each oligomer in the library by determining the coding sequences in 
the recombinant organism or phage or by the location of the oligomer on the 
VLSIPS™ chip. In other methods, however, the identity of a particular oligomer may 

2 5 be difficult to ascertain. What is needed in these latter methods is an efficient and 

simple-to-use metiiod for tagging each particle. Although tagging methods have been 
developed for large objects, see PCT patent publication Nos. 90/14441 and 87/06383, 
each of which is incorporated herein by reference, such methods are still needed for 
combinatorial libraries of oligomers. 

3 0 From the above, one can recognize that improved methods and 

apparatus for synthesizing a diverse collection of chemical sequences would be 
beneficial. 

SUMMARY OF THE INVENTION 
3 5 The present invention provides a general stochastic method for 



wo 93/06121 



PCT/US92/07815 



synthesizing Iftraries of random oUgomers. The random oUgomers are synthesized 
on soUd supports, or particles, but may be cleaved from these supports to provide a 
soluble library. Hie oUgomers are composed of a sequence of monomers, the 
monomers being any member of the set of molecules that can be joined together to 
5 form an oUgomer or polymer, i.e., amino adds, carbamates, sulfones, sulfoxides, 
nucleosides, carbohydrates, ureas, phosphonates, Upids, esters, combinations of the 
same and the like. The Ubrary is then saeened to isolate individual oUgomers that 
bind 'to a receptor or possess some desired property. Each oUgomer sequence m the 
library is unique, in a preferred embodiment In another preferred embodmient, the 

1 0 soUd supports are nonporous beads. The soUd supports may be composed of a smgle 

particle, or two or more Unked particles. 

A further embodiment of the invention relates to the use of an identifier 
tag to identify the sequence of monomers in the oligomer. The identifier tag, which 
may be attached directly to the oUgomer with or without an accompanying particle, to 
15 a Unker attached to the oUgomer, to the soUd support upon which the oUgomer is 
synthesized, or to a second particle attached to the oligomer-canying particle, may be 
any recognizable feature tiiat in some way carries the required information, and that is 
decipherable at the level of one or a few soUd supports. Tlie soUd supports may be 
joined to the oUgomers and the identifier tag by means of one or more Unker 

2 0 molecules. , ..^ 

In a preferred embodiment, the identifier tag wiU be an oUgonudeotide, 

preferablv composed of pyrimidines or pyrimidines and purine analogs or any type of 
nudeoside that wiU not degrade under the coupUng conditions used to assemble the 
oUgomer Ubrary. TUe oUgonudeotide identifier tag may contain a 5* and a 3' 
9 5 ampUfication site, to aUow ampUfication of the tag by, for example, the polymerase 
diain reaction (see U.S. Patent Nos. 4,683,202; and 4,965,188, eadi of whidi is 
incorporated herein by reference). A DNA sequendng primer site, whidi may be 
spedfic for eadi step of the oUgomer synthesis, may also be induded in the 
oUgonudeotide tag in addition to the ampUfication primer sites. The tag may be 

3 0 designed to indude, in the oUgonudeotide sequence, information aUowing 

identification of the monomer assodated with the addition of tiie particular tag. The 
oUgonudeotide tag wiU be about 50 to 100 nudeotides in length, in a preferred 
embodiment. 

In another preferred embodiment, the identifier tag may be composed of 
3 5 a set of Ught-^ddressable compounds, sudi as fluorescent or phosphorescent 
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5 

compounds that can be photobleached, which compounds are incorporated into the 
beads or particles on which the oligomers of the oligomer library are synthesized. 
Such compounds are widely known in the art. 

5 BRIEF DESCRIPTION OF THE HGURES 

Fig- 1 is a schematic representation of combinatorial oligomer synthesis . 

on particles. 

Fig. 2 is a schematic representation of concurrent combinatorial oligomer 
synthesis and partide tagging. 
1 0 Fig- 3 describes one method of bead functionaiization, the compatible 

chemistries for peptide s)^thesis and round by round attachment of oligonucleotide 
identifier tags, including synthesis of amino-functionalized beads, the structure of 
protected 5' maieimidyl oligonucleotides, amino acid coupling and introduction of a 
thiol "handle/ step-specific oligonucleotide attachment to a bead, subsequent amino 

1 5 acid coupling(s) and oligonucleotide attachment(s), and peptide and oligonucleotide 

deprotection. 

Fig, 4 is a schematic representation of one example of an oligonucleotide 

tag. 

Fig. 5 illustrates nucleoside phosphoramidites derivatized with 
20 photolabile protecting groups for parallel peptide/oligonucleotide synthesis. 

Fig. 6 illustrates 5 -DMT-3 -(O-allyl N,N'-diisopropyl phosphoramidite) 
nucleoside derivatives for parallel peptide /oligonucleotide synthesis. 

Fig. 7 illustrates the preparation of a bifunctional bead material for 
parallel synthesis of peptides and oligonudeotides- 

2 5 Fig. 8 illustrates the parallel assembly of oligonudeotide-tagged peptides 

on beads- 
Fig. 9 shows a schematic representation of the experiment described in 
Example 5, in which two popidations of oligomers on beads are prepared, tagged, 
mixed, sorted, and identified by the method of the present invention. 

3 0 Fig. 10 shows resolution of the two populations of beads by FACS in the 

experiment described in Example 5. Values along the horizontal axis indicate 
relative fluorescence (log scale). Values along the vertical axis indicate relative 
numbers of beads. Non-fluorescently labeled beads are represented by peak A. 
Flourescentiy labeled beads are represented by peak B. The ratio of the larger peak to 
3 5 the smaller peak is 15:1. 
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Fig. 11 shows pictures of ethidium bromide stained, UV irradiated 
agarose gels of PGR products obtained by amplification after FACS of two bead 
populations and amplification of the tags on the sorted beads, with controls, as 
described in Example 5. Gel A shows the results with sorted fluorescent beads: lane 
5 1 - 14xl06 copies (100 bead equivalents) of 95 mer tag; lanes 2-7 - PGR product from 
single fluorescent beads; lanes 8-10 - PGR product from ten fluorescent beads; lanes 
11-13 - PGR product from one himdred fluorescent beads; and lane 14 - 1.2xl06 
copies (100 bead equivalents) of 110 mer tag. Gel B shows the result with sorted non- 
fluorescent beads: lane 1 - 1^10^ copies of 110 mer tag; lanes 2-7 - PGR product 
1 0 from single non-fluorescent b'eads; lanes 8-10 - PGR product firom ten non- 
fluorescent beads; lanes 11-13 - PGR product from one hundred non-fluorescent 
beads; and lane 14 - 2.4xl06 copies of 95 mer tag. Gel G shows the results with the 
control reactions: lanes 1,12 - DNA size standards; lanes 2, 3 - no tag control 
reactions; lanes 4, 5 - 1 bead equivalent of soluble 95 mer tag; lanes 6, 7 - 10 bead 

1 5 equivalents of soluble 95 mer tag; lanes 8, 9 - 1 bead equivalent of soluble 110 mer 

tag; and lanes 10, 11 - 10 bead equivalents of soluble 110 mer tag. 

DESGRIPnON OF THE SPEGIFIG EMBODIMENTS 
The present invention provides novel methods and instruments for 

2 0 producing large synthetic oligomer libraries- In a preferred embodiment of the 

present invention, each member of such a library is uniquely labeled in a manner that 
specifies the identity of the sequence of the oligomer corresponding to that member. 
Methods for screening such libraries and reagents useful for producing the libraries 
^ are also provided, 

25 

Glossary 

The following terms are intended to have the following general 
meanings as they are used hereiru 

Gomplementary or substantially complementary: These terms refer to 

3 0 base pairing between nucleotides or nucleic adds, such as, for instance, between the 

two strands of a double stranded DNA molecule or between an oligonucleotide 
primer and a primer binding site on a single stranded nudeic add to be sequenced or 
amplified- "Gomplementary" nudeotides are, generally, A and T (or A and U), and G 
and G, as is well known to those of skill in the art. Two single stranded RNA or DNA 
35 molecules are said to be "substantially complementary" when the nudeotides of one 
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strand, optimally aligned, pair with at least about 80% or more of the nucleotides of 
the other strand. 

Alternatively, substantial complementarity exists when an RNA or DNA 
strand will hybridize tmder selective hybridization conditions to a complementary 
5 nucleic add. T3rpically, selective hybridization will occur when tiiere is at least about 
55% complementarity over a stretch of at least 14 to 25 nucleotides, but more selective 
hybridization will occur as complementarity increases to 65%, 75%, 90%, and 100%. 
See Kanehisa, Nucleic Acids Res. 12 :203 (1984), incorporated herein by reference. 

Stringent hybridization conditions will typically include salt 
1 0 concentrations of less than about 1 M, such as less than 500 mM, and will often 

include salt concentrations of less than 200 mM. The hybridization temperature for 
oligomers will typically be greater than 22**C, such as greater than about SO^^C, and will 
often be in excess of about 37°C. Longer fragments may require higher hybridization 
temperatures for specific hybridizatioa As other factors may dramatically affect the 

1 5 stringency of hybridization (such factors include base composition, length of the 

complementary strands, presence of organic solvents, and extent of base 
mismatching), the combination of factors is more important than the absolute 
measure of any one factor alone. 

Epitope : The portion of an antigen molecule delineated by the area of 

2 0 interaction with the subclass of receptors known as antibodies is an "epitope." 

Identifier tag : An "identifier tag" is a physical attribute that provides a 
means whereby one can identify which monomer reactions an individual solid 
support has experienced in the sjmthesis of an oligomer. The identifier tag also 
records the step in the synthesis series in which the solid support visited that 

2 5 monomer reaction. The identifier tag roay be any recognizable feature, including for 

example: a microscopically distinguishable shape, size, color, optical density, etc.; a 
differential absorbance or emission of light; chemically reactivity; magnetic or 
electronic encoded information; or any other distinctive mark with the required 
information, and decipherable at the level of one (or a few) solid support(s). A 

3 0 preferred example of such an identifier tag is an oligonucleotide sequence. An 

"identifier tag" can be coupled directiy to the oligomer synthesized, whether or not a 
solid support is used in the syndesis. In this latter embodiment, the identifier tag 
serves as the "support" for oligomer synthesis. 

Li gand: A "ligand" is a molecule that is recognized by a particular 
3 5 receptor. The agent bound by or reacting with a receptor is called a "ligand", a term 
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which is definitionally meaningful only in tenns of its counterpart receptor. Ihe 
term "Ugand" does not imply any particular molecular size or other ° 
composLnal feature other than that the substance in question xs capable of bm^g 
or o^erwise interacting with the receptor. Also, a "Hgand" may serve aU^as 
natural ligand to which the receptor binds, or as a functional analogue that may art as 
an agonist or antagonist Ugands that can be investigated by ti^s invenhon md^ 
but L not restricted to, agonists and antagonists for ceU membrane 
and venoms, viral epitopes, hormones, sugars, cofactors, peptides, enzyme substrates, 
cofartors, drugs (e.g., opiates, steroids, etc.), and proteins. 

Monomer: A "monomer" is any member of the set of molecules which 
can be joined together to form an oUgomer or polymer. The set of ™- 
in the present invention includes, but is not restiicted to, for the example of peptide 
syntitesis, the set of L-amino adds, D-amino adds, or synthetic amino aads. As us^d 
h^ein, "monomer" refers to any member of a basis set for synthesis of an ohgomer. 
For example, dimers of L-amino adds form a basis set of 400 "monom^s for 
syntixesis of polypeptides. Different basis sets of monomers maybe used at successive 

steps in the synthesis of a polymer. 

oK ^^.r nr Polvmer: The "oUgomer" or "polymer" sequences of tiie 
present invention are formed from tiie diemical or enzymatic addition of mom^mer 
submdts. Sudt oHgomers indude, for example, botii linear, cydic, and brandted 
polvmers of nudeic adds, polysacdiarides, phospholipids, and peptides havmg eiti^er 
alpJia-, beta-, or omega-amino adds, heteropolymers, polyuretiianes polyesters, 
polycarbonates, polyureas, polyamides, polyetiiyleneimines, P^^'y^j'^^^^f^' 
- polvsiloxanes, polyimides, polyacetates, or other polymers, as will be readily apparent 
to one sldUed in the art upon review of this disdosure. 

Peptide- A "peptide" is an oUgomer in whidi tiie monomers are alpha 
amino adds joined together through amide bonds. Alternatively, a "peptide" can be 
referred to as a "polypeptide." In tiie context of this spedfication, one should 
appredate that tiie amino adds may be tiie L-optical isomer or the I>optical isomer. 
Peptides are more than two amino add monomers long, but more often are more 
th^5 to 10 amino add monomers long and can be even longer than 20 amino acids, 
although peptides longer than 20 amino adds are more likely to be called 

"polypeptides." Standard single letter abbreviations for amino adds are used (e.g., F 
for proline). These abbreviations are induded in Stryer, lis^semisJiy, Third Ed. 
(1988), which is incorporated herein by reference. 
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Oli gonucleotides : An "oligonucleotide" is a single-stranded DNA or 
RNA molecule, typically prepared by synthetic means. Those oligonucleotides 
employed in the present invention will usually be 50 to 150 nucleotides in length, 
preferably from 80 to 120 nucleotides, although oligonucleotides of different length 
5 may be appropriate in some circumstances- For instance, in one embodiment of the 
invention, the oligonucleotide tag and the polymer identified by that tag are 
synthesized in parallel. In this embodiment, the oligonucleotide tag can be built 
nucleotide-by-nucleotide in coordination with the monomer-by-monomer addition 
steps used to synthesize the oligomer. In addition, very short, i.e., 2 to 10 nucleotides, 
1 0 oligonucleotides may be used to extend an existing oligonucleotide tag to identify a 
monomer coupling step. Suitable oligonucleotides may be prepared by the 
phosphoramidite method described by Beaucage and Carruthers, Tetr . Lett. 22:1859- 
1862 (1981), or by the triester method, according to Matteucd et al./ T. Am. Chem, Soc. 
103:3185 (1981), both incorporated herein by reference, or by other methods such as by 

1 5 using commercial automated oligonucleotide synthesizers. 

Operably linked : A nucleic add is "operably linked" when placed into a 
functional relationship with another nudeic add sequence. For instance, a promoter 
or enhancer is "operably linked" to a coding sequence if the promoter causes the 
transcription of the sequence. Generally, operably linked means that the DNA 

2 0 sequences being linked are contiguous and, where necessary to join two protein 

coding regions, contiguous and in reading frame. 

Receptor: A "receptor" is a molecule that has an affinity for a given 
ligand. Receptors may be naturally-occurring or marunade molecules. Also, receptors 
can be employed in their unaltered natural or isolated state or as aggregates with other 

2 5 spedes. Receptors may be attached, covalently or noncovalently, to a binding 

member, either directly or via a specific binding substance. Examples of receptors that 
can be employed in the method of the present invention indude, but are not 
restricted to, antibodies, cell membrane receptors, monodonal antibodies, antisera 
reactive with specific antigenic determinants (such as on viruses, cells, or other 

3 0 materials), polynudeotides, nudeic adds, lectins, polysaccharides, cells, cellular 

membranes, and organelles. Receptors are sometimes referred to in the art as "anti- 
ligands." As the term "receptor" is used herein, no difference in meaning is intended. 
A "ligand-receptor pair" is formed when two macromolecules have combined 
through molecular recognition to form a complex. 
3 5 Other examples of receptors that can be investigated by this invention 
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include, but are not restricted to; 

T^irrnnr^anism receptors: Determination of Ugands that bind to 
miaoorganism receptors, such as specific transport proteins or enzymes essential to 
survival of microorganisms, is useful in discovering new classes or types of 
5 antibiotics. Of particular value would be antibiotics against opportunistic fungi, 
protozoa, and those bacteria resistant to the antibiotics in current use. 

Enzymes : For instance, the binding site of any enzyme, such as the 
enzymes responsible for cleaving neurotransmitters, is a receptor. Determination of 
ligands that bind to certain enzymes, and thus modulate the action of the enzymes 
1 0 that cleave the different neurotransmitters, is useftil in the development of drugs that 
can be used in the treatment of disorders of neurotransmission. 

Antibodies: For instance, the invention may be useful in investigating 
the ligand-binding site on an antibody molecule that combines with the epitope of an 
antigen of interest Determining a sequence that mimics an antigenic epitope may 

1 5 lead to the development of vacdnes or lead to the development of related diagnostic 

agents or compounds useful in therapeutic treatments such as for autoimmune 
diseases (e.g., by blocking the binding of tiie "self" antibodies). 

Miideic Acids: The invention may be useful in investigating sequences 
of nucleic adds acting as binding sites for ceUular proteins ("trans-acting factors"). 

2 0 Such sequences may indude, e.g., enhancers or promoter sequences. 

ratalvtic Polyppptides: Polymers, preferably polypeptides, which are 
capable of promoting a chemical reaction involving tiae conversion of one or more 
reactants to one or more products are "catalytic polypeptides." Such polypeptides 
~ generally indude a binding site specific for at least one reactant or reaction 

2 5 intermediate and an active functionaHty proximate to tiie binding site, which 
functionality is capable of chemically modifying tiie bound reactant. Catalytic 
polypeptides are described in, Lemer ej al., §sesc£2S: 659 (1991), whidi is 
incorporated herein by reference. 

Wnrmnne receptors: For instance, "hormone receptors" indude the 

30 receptors for insulin and growtii hormone. Determination of tiie Kgands which bind 
with high affinity to a hormone receptor is useful in the development of, for example, 
an oral replacement of tiie daily injections which diabetics must take to relieve the 
symptoms of diabetes, and in die other case, a replacement for human growth 
hormone, which can only be obtained from cadavers or by recombinant DNA 

35 technology. Other examples indude the vasoconstrictive hormone receptors; 
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determination of ligands that bind to those receptors may lead to the development of 
drugs to control blood pressure. 

Opiate receptors : Determination of ligands that bind to the opiate 
receptors in the brain is useful in the development of less-addictive replacements 
5 for morphine and related drugs. 

Substrate or Solid Support : A "substrate or solid support" is a material 
having a rigid or semi-rigid stirface. Such materials will preferably take the form of 
small beads, pellets, disks, or other convenient forms, although other forms may be 
used. In some embodiments, at least one surface of the substrate will be substantially 

1 0 flat. A roughly spherical shape is preferred. 

Synthetic : A compound is "synthetic" when produced by in vitro 
chemical or enzymatic synthesis. The sjmthetic libraries of the present invention may 
be contrasted with those in viral or plasmid vectors, for instance, which may be 
propagated in bacterial, yeast, or other living hosts. 

15 

L Method for Producing Large Synthetic Oligomer Libraries 

A general method of random oligomer s)mthesis is provided by the 
present invention. The method can be used to produce the enormous ntunbers of 
compounds available with recombinant systems and to utilize the monomer set 

2 0 diversity available with chemical S3mthesis methods. By means of the present 

method, one can readily produce up to 1012 different oligomers, a dramatic 
improvement over previous methods. The invention also provides a facile means of 
oligomer identification. 

The general method comprises producing a large, highly diverse 

2 5 collection or library, each member of such a library comprising a single oligomer 

sequence (e.g., a peptide). The sequence may be soluble or may be bound to a solid 
support. When bound to a solid support, the oligomer is usually attached by means of 
a linker. The linker, prior to attachment, has an appropriate functional group at each 
end, one group appropriate for attachment to the support and the other group 

3 0 appropriate for attachment to the oligomer. Such a collection may contain, for 

example, all combinations of n monomers assembled into X length oligomers 
yielding, nX different compoimds. The collection may also contain oligomers having 
different monomer units at, for example, only one or a small number of positions, 
while having an identical sequence at all other positions. The general method 
3 5 typically involves synthesizing the oligomers in a random combinatorial 
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("Stochastic") fashion by chemical and/or enzymatic assembly of monomer units. 

A synthetic oUgomer Ubrary may be produced by synthesizing on each of 
a plurality of solid supports a single oUgomer sequence, the oligomer sequence being 
different for different soUd supports. The oUgomer sequence is synthesized in a 
5 process comprising the steps of: (a) apportioning the supports in a stochastic manner 
among a plurality of reaction vessels; (b) exposing the supports in each reaction vessel 
to a first monomer; (c) pooling the supports; (d) apportioning the supports in a 
stochastic manner among the pluraUty of reaction vessels; (e) exposing the supports in 
each reaction vessel to a second monomer; and (f) repeating steps (a) through (e) from 

1 0 at least one to twenty times. TypicaUy, substantially equal numbers of soUd supports 

will be apportioned to each reaction vesseL In one embodiment of the method, the 
monomers are chosen from the set of amino adds, and the resulting oligomer is a 
peptide. 

As a specific example of the method, one may consider the synthesis of 
1 5 peptides three residues in length, assembled from a monomer set of three different 
monomers: A, B, and C. The first monomer is coupled to three different aUquots of 
beads, each different monomer in a different aHquot, and the beads from all the 
reactions are then pooled (see Fig. 1). The pool now contains approximately equal 
numbers of three different types soUd supports, with each type characterized by the 

2 0 monomer in the first residue position. The pool is mixed and redistributed to the 

separate monomer reaction tubes or vessels containing A, B, or C as the monomer. 

The second residue is coupled. 

Following this reaction, each hibe now has beads with three different 
^ monomers in position one and the monomer contained in each particular second 

2 5 reaction tube in position 2. AU reactions are pooled again, producing a mixture of 

beads each bearing one of the nine possible dimers. The pool is again distributed 
among the .three reaction vessels, coupled, and pooled. This process of sequential 
synthesis and mixing yields beads that have passed through all the possible reaction 
pathways, and the collection of beads displays aU trimers of three amino adds (33 = 
30 27). Thus, a complete set of the teimers of A, B, and C is constructed. As can be readily 
appredated, the use of a suffidently large number of synthesis beads helps to ensure 
that the set completely represents tiae various combinations of monomers employed 
in this random, combinatorial synthesis scheme. 

This mettiod of assembling oligomers firom many types of monomers 

3 5 requires using the appropriate coupling chemisby for a given set of monomer units 
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or building blocks. Any set of building blocks that can be attached to one another in a 
step-by-step fashion can serve as the monomer set. The attachment may be mediated 
by chemical, enzymatic, or other means, or by a combination of any of these means. 
The resulting oligomers can be linear, cyclic, branched, or assume various other 
5 conformations as will be apparent to those skilled in the art. Techniques for solid 
state synthesis of polypeptides are described, for example, in Merrifield, supra. Peptide 
coupling chemistry is also described in The Peptides. Vol. 1 (eds. Gross, E., and J. 
Meienhofer, Academic Press, Orlando (1979)), which is incorporated herein by 
reference. 

10 To synthesize the oligomers, a collection of a large number of the solid 

supports is apportioned among a number of reaction vessels. In each reaction, a 
different monomer is coupled to the growing oligomer chairu The monomers may be 
of any type that can be appropriately activated for chemical coupling or accepted for 
enzymatic coupling. Because the reactions may be contained in separate reaction 

1 5 vessels, even monomers with different coupling chemistries can be used to assemble 

the oligomers (see The Peptides, supra V The coupling time for some of the monomer 
sets may be long. For this reason, the preferred arrangement is one in which the 
monomer reactions are carried out in parallel. After each coupling step, the solid 
supports on which are synthesized the oligomers of the library are pooled and mixed 
20 prior to re-allocation to the individual vessels for the next coupling step. This 
shuffling process produces solid supports with many oligomer sequence 
combinations. If each synthesis step has high coupling effidehcy, then substantially 
all the oligomers on a single solid support have the same sequence. That sequence is 
determined by the synthesis pathway (t5rpe and sequence of monomer reactions) for 

2 5 any given solid support at the end of the sjmthesis. The maximum length of the 

oligomers is typically less than about 20, usually from 3 to 8 residues in length, but in 
some cases a length of 10 to 12 residues is preferred. Protective groups known to those 
skilled in the art may be vised to prevent spurious coupling (see The Peptides, Vol. 3 
(eds. Gross, E., and J. Meienhofer, Academic Press, Orlando (1981), which is 

3 0 incorporated herein by reference). 

Modifications of this completely random approach are also possible. For 
example, the monomer set may be expanded or contracted from step to step; or the 
monomer set could be changed completely for the next step (e.g., amino adds in one 
step, nudeosides in another step, carbohydrates in another step), if the coupling 
3 5 chemistry were available (see Gait, Oli gonudeotide Synt hesis: A Practical Approach. 
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IRL Press, Oxford (1984); Friesen and Danishefsky, T. Amer. Chem. Soc. 111:6656 (1989); 
and Paulsen, a. ri..n, Tnh Ed. Engl 25:212 (1986), all of which are incorporated 
herein by reference). A monomer unit for peptide synthesis, for example, may 
include single amino adds or larger peptide units, or both. One variation is to form 
5 several pools of various sequences on soUd supports to be distributed among different 
monomer sets at certain steps of the synthesis. By this approach, one can also build 
oUgomers of different lengths with either related or unrelated sequences, and one can 
fix certain monomer residues at some positions whUe varying the other residues, to 
construct oligomer frameworks wherein certain residues or regions are altered to 

1 0 provide diversity. 

The chemical or enzymatic synthesis of the oUgomer Ubraries of the 
present invention typically takes place on soUd supports. The term "soUd support" as 
used herein embraces a particle witii appropriate sites for oUgomer synthesis and, in 
some embodiments, tag attachment and/or synthesis. There are various soUd 
1 5 supports useful in preparation of the synthetic oUgomer Ubraries of the present 
invention. SoUd supports are commonly used for soUd phase synthesis of, for 
example, peptides and nucleic adds and otiier oUgomers as enumerated above, and 
thus are weU known to those skilled in the art. 

With enough soUd supports and effident coupUng, one can generate 

2 0 complete sets of certain oUgomers, if desired. In general, the soUd support size is in 

the range of 1 nm to 100 fim, but a more massive soUd support of up to 1 mm in size 
may be used. The appropriate size of the soUd support depends on (1) the number of 
oUgomer synthesis sites and identifier tag attachment sites desired; (2) the number of 
different compounds to be synthesized (and tiie number of soUd supports bearing eadi 
25 ^ oUgomer that are needed for screening); and (3) ti:ie effect of the size of the soUd 
supports on the spedfic screening strategies [e.g., fluorescence-activated ceU sorters 

(FACS)] to be used. 

As a spedfic example, soUd supports of 1 nm in diameter may be used. H 
each reaction contains approxhnately 0.2 mL of soUd supports, and the oligomers are 

3 0 synthesized from a set of 50 monomers (50 paraUd reactions), then a total of 10 mL of 

soUd supports, or approximately 10l3 soUd supports, would be required. If one wishes 
to make hexamers with these 50 monomers, then there are over 1.5 x lOlO possible 
sequences, and each spedfic sequence would be represented on about 103 soUd 
supports. An estimated capadty of each bead, based on tiie capadty of commonly used 
3 5 peptide synthesizing resins, is about 0.1 pg of peptide per bead. By this estimation. 
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then, each soUd support woiiid have about 100 amoi or 108 oUgoma- chains. 

To improve washing effidendes, solid supports less porous than typical 
peptide synthesis resins are preferable. These supports will have a lower density of 
growing chains, but even with a deaease in capadty of several orders of magnitude, 
5 suffident oligomer densities can be produced for effident saeening. With the less 
porous supports, a greater proportion of the oligomers will be accessible for binding to 
the receptor during the screening process. Also, the less porous supports will reduce 
the carryover of tags from one reaction to the next, thus improving the accuracy of 
reading the dominant (correct) tags. 
1 0 Such soUd supports may be of any shape, although they will preferably be 

roughly spherical. The supports need not necessarily be homogenous in size, shape, 
or composition; although the supports usually and preferably wiU be uniform. In 
some embodiments, supports that are very uniform in size may be particularly 
preferred. In another embodiment, however, two or more distincdy different 

1 5 populations of solid supports may be used for certain purposes. 

Solid supports may consist of maiiy materials, limited primarily by 
capadty for derivatization to attach any of a number of chemically reactive groups and 
compatibility with the chemistry of oligomer synthesis and tag attachment. Suitable 
support materials indude glass, latex, heavUy aoss-linked polystyrene or similar 

2 0 polymers, gold or other coUoidai metal partides, and other materials known to those 

skilled in the art. Except as otherwise noted, the chemically reactive groups with 
which such solid supports may be derivatized are those commonly used for solid state 
synthesis of the respective oligomer and thus will be well known to those skilled in 
the art. The solid supports of the present invention do not indude living cells, 

2 5 viruses, or doning vedors such as phage vedors or plasmids. 

n. Method for Produdng Tagged Synth etic Oligomer Libraries 

In a preferred embodiment of the invention, the oligomers comprising 
the library also are attached to an identifier tag tiiat can be easily decoded to report Mie 

3 0 sequence of each oligomer. The identifier tags may be attached dther to the oligomer 

or to the soUd support to which the oligomer is attached. The attachment is preferably 
by means of a linker that, prior to attachment, has an appropriate functional group at 
each end, one group appropriate for attachment to the support and the other group 
appropriate for attachment to the identifier tag. Alternatively, the identifier tag may 
3 5 be attached to a monomer incorporated into the oligomer or attached directly to the 
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same linker that binds the oligomer to the soUd support In the latter embodiment, 
the linker has, prior to attachment, a third functional group appropriate for the 
attachment of the identifier tag. 

A synthetic oligomer library that incorporates identifier tags is produced 
5 by synthesizing on each of a plurahty of solid supports a single oligomer sequence and 
one or more identifier tags identifying the oUgomer sequence. The tagged synthetic 
oligomer library is synthesized in a process comprising the steps of: (a) apportioning 
the supports among a plurality of reaction vessels; (b) exposing the supports in each 
reaction vessel to a first oHgomer monomer and to a first identifier tag monomer; (c) 

1 0 pooling the supports; (d) apportioning the supports among a plurahty of reaction 

vessels; and (e) exposing the supports to a second oligomer monomer and to a second 
identifier tag monomer. As noted above, one can also practice the invention in a 
mode in which there is no soUd support; in tiiis mode, the tag is attached directly to 
the ohgomer being synthesized. The steps of eitiier process typically will be repeated 
15 one or more times, but usually, will be repeated less than 20 times. 

The solid supports can be exposed to (or coupled witii) an oligomer 
monomer and an identifier tag at the same time, or sequentially. In either event, tiie 
supports are then pooled and eqjosed to ihe second oligomer monomer and second 
identifier tag. As before, tinese steps are repeated, typically from one to about 20 times. 

2 0 The invention is described herein primarily with regard to the preparation of 

molecules mntaining sequences of amino adds, but ihe invention can readily be 
applied to the preparation of otiier oligomers and to any set of compounds tiiat can be 
synthesized in a component-by-component fashion, as can be appreciated by those 
skilled in the art 

25' In anotiier embodiment tiie same soUd support is used for syntiiesizing 

an members of the library, but tiie members are cleaved fi-om the support prior to 
screening. In tiiis embodiment syntiiesis of tagged oligomers may be accomplished 
utilizing very large scale immobilized polymer syntiiesis (VLSIPS™) techniques. See 
U.S. Patent No. 5,143,854 and PCT patent publication No. 92/10092, each of which is 

3 0 incorporated herein by reference. An array of oligonucleotides is synthesized on the 

VLSIPS™ chip, each oligonucleotide linked to the chip by a deavable group such as a 
disulfide. In one embodiment each oligonudeotide tag has an amine group at the 
fiee end and only contains pyrimidine or pyrimidine and purine analog bases. In 
addition eadi oligonudeotide contains binding sites for amplification, i.e., PGR 
3 5 primer sites and optionally a sequencing primer site. A short section of each 



wo 93/06121 



PCr/US92/07815 



17 

oligonucleotide uniquely codes the monomer sequence of the oligomer to be tagged. 
Then, e.g., peptides are synthesized, optionally from the free terminal amine groups 
on each oligonucleotide, so that each peptide is linked to a tag. The whole collection 
of oligonucleotide-peptide may be released from tiie chip to create a soluble tagged 

5 oligomer library. 

More preferably, however, the oligomer library is constructed on beads or 
particles. One method of bead functionalization, with compatible chemistries for ^ 
peptide synthesis and round by round attachment of oligonucleotide identifier tags, is 
shown in Figs. 3.1-3.6. Glass beads are derivatized using aminopropyltriethoxysilane 
1 0 and a beta-alanine spacer group is coupled using activated ester methodology. The 
oligonucleotide tags may optionally incorporate a biotin group to facilitate 
purification, hybridization, amplification, or detection (see Fierce 
ImmunoTechnologv Catalog and Handbook, 1991, incorporated herein by reference). 
Commercially available Fmoc protected amino adds and standard BOP coupling 

1 5 chemistry is employed for peptide synthesis (see The Peptides, supra). Protected 

polypyrimidine (e.g., cytidine protected as N^-Bz-C) and/or purine analog containing 
oligonucleotides resistant to the coupling and deprotection reagents used in peptide 
synthesis are attached using maleimide chemistry to immasked thiol groups 
incorporated into growing peptide chains at low frequency (i.e., 0.1%) as cysteine 

2 0 residues witii masked thiol groups (which masks may be selectively removed prior to 

tagging). In other embodiments of the invention, one may not need to use protected 
nucleosides or oligonucleotides. 

However, to maintain the integrity of an oligonucleotide tag during 
peptide synthesis, one may need to use different combinations of protecting groups 

2 5 and /or synthetic nucleotides to avoid degradation of the tag or the oligomer 

synthesized. In general, polypyrimidine oligonucleotide tags are relatively stable 
under typical peptide synthesis conditions, as opposed to oligonucleotide tags that 
contain natural purine nucleotides, but a polypyrimidine nucleotide tag may be 
somewhat refractory to amplification by PCR. One may need to incorporate purine 

3 0 bases, or analogs tested for ability to withstand peptide coupEng (and deprotection) 

conditions, into the tag to acheive a desired efficiency of amplification. For purposes 
of the present invention, the tag optionally may contain from 10 to 90%, more 
preferably 35 to 50%, and most preferably 33 to 35%, purine or purine analog 
nucleotides. The oligonucleotides optionally may contain phosphate protecting 
3 5 groups (e.g., O-methyl phosphates) witii greater base stability than the standard beta- 
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cyanoethyl group, which may be susceptible to piperidine cleavage. In such cases, 
peptide and aUgonudeotide deprotection can be effected by sequential treatment with 
tfaiophenol, trifluoroacetic add, and ethanoUc ethylenediamine at 55 degrees C 
In another embodiment, photolabile aipha-aniino protecting groups are used in 
5 conjunction with base-labile side chain protecting groups for the amino adds, and 
standard beta-cyanoethyi protecting groups are used for the oUgonudeotide tags. 

In another embodiment, oligonudeotides containing both modified or 
synthetic purines and pyrimidines may be synthesized in paraUel with peptides using 
conventional Fmoc/tBu protected amino adds. In this method, one can also use O- 
1 0 allyl and N-aUyloxycarbonyl groups to provide protection for phosphate oxygens and 
the exocydic amines of the nudeoside bases, respectively (see Hayakawa etalv I- 
Amer. Chem. Soc. 112: 1691-1696 (1990), incorporated herein by reference). By 
employing the mild oxidant tBuOOH for oxidation at the phosphorous, one can 
n^iTii-TTiiyp oxidation of the amino adds methionine, tryptophan, and histidine (see 

1 5 Hayakawa et Teto. Lett. 27:4191-4194 (1986), incorporated herein by reference). Use 

of pyiidinium hydrochloride/imidazole as a phosphporamidite activator leads to 
sdective 5 -O-phosphitylation at the expense of low levels of spurious reaction at 
nitrogen on the peptide or oligonucleotide (see Gryaznov and Letsinger, Nudeic 
y^rids Research 20: 1879-1882 (1992) incorporated herein by reference). The lability of 

2 0 purine nudeotides to strong add (e.g., TFA) is avoided by use of phosphoramidites of 

the purine nudeoside analogs 7-deaza-2'-deoxyadenosine and 7-deaza-2'- 
deoxyguanosine (see Barr p<- al.. BioTechnigues 4:428-432 (1986), and Scheit, Nudeotide 
Analo gs: SvPthp-ris and Biological Function pp. 64-65 (John WUey and Sons, New 
York), both incorporated herdn by refaence). 

2 5 ' The fully assembled peptide and oUgonudeotide chains may be 

deprotected by first treating the products with 30% piperidine in DMF to remove 
amino-tenninal Fmoc groups. Then, the allyUc protecting groups are removed using 
THF containing tris (dibenzyiideneacetone) dipalladium-chloroform complex, 
tiiphenylphosphine, and n-butylamine/formic add, foUowed by a THF wash, an 

3 0 aqueous sodium N,N-diethyldithiocarbamate wash, and a water wash. Finally, the 

add-labile amino add protecting groups are removed by treatment widi 95:5 
TFA/water. 

Other methods also provide efective orthogonal protection during die 
paraUd assembly of oligonudeotides and peptides. These methods indude use of 
3 5 add-labile protecting groups on phosphates and exocydic amines of deoxycytidine, 7- 
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deaza-deoxyadenosine, and 7-deaza-deoxyguanosine suffidentiy robust to resist the 
3% trichloroacetic add used in 5'-0-detritylation; use of photochemically removable 
protecting groups on these residues; and combinations of such add and photoiabile 
groups (for photoiabile protecting groups for phosphate, see Baldwin et al., Tetr. Lett. 
5 46: 6879-6884 (1990), incorporated herein by reference; see also figure 5). 

EL Identifying the Sequence of any Oligomer 

The present invention provides a method for identif jdng the 
composition and sequence of any of the oligomers in the library. By tracking the 

1 0 synthesis pathway that each oligomer has taken, one can deduce the sequence of 

monomers of any oligomer. The method involves linking an identifier tag to the 
oligomer that indicates the monomer reactions and corresponding step numbers that 
define each oligomer in the library. After a series of synthesis steps (and concurrent 
identifier tag additions), one "reads" the identifier tag(s) assodated with an oligomer 
1 5 to determine the sequence of that oligomer. 

For example, one might attach microscopically recognizable, 
alphanumeric tags to each bead (see Fig. 2): "Al" means that the bead partidpated in 
the A-monomer reaction at step 1, "CZ" means that the bead partidpated in tixe C- 
monomer reaction at step 2, and "B3" means B-monomer was added in step 3, and so 

2 0 on. At the end of the 3-step syntiiesis, the bead would have tiiree tags attached, e.g., 

Al, C2, and B3, indicating that the sequence of the peptides on the bead is ACB. This 
scheme requires a number of distinct identifier tags equal to at most the product of the 
number of different monomers and the number of synthesis steps (nine in tiiis 
example). The number of identifier tags is reduced if the symbols are attached to one 

2 5 another in the order of Hie steps: A, A-C, A-C-B. In this case only as many identifier 

tags are needed as monomers. One builds the identifier tag in much the same way as 
the peptides, so as to preserve a record of what was monomer was added, and in 
which addition step. 

The identifier tags therefore identify each monomer reaction that an 

3 0 individual library member or solid support has experienced and record the step in the 

synthesis series in which each monomer is added. The tags may be attached 
immediately before, during, or after the monomer addition reaction, as convenient 
and compatible with the type of identifier tag, modes of attachment, and chemistry of 
oligomer synthesis. The identifier tag is added when the solid supports that have 
3 5 tmdergone a specific monomer addition step are physically together and so can be 
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taeeed as a group, Le., prior to the next pooling step. 

m some cases, of course, when only a smaU number of monomer units 
of an oUgomer are varied, one may need to identify only those monomers which vary 
among the oligomers, as when one wants to vary only a few amino adds m a peptide. 
5 For instance, one might want to change only 3 to 6 amino adds in peptides 6 to 12 
amino adds long, or one might want to diange as few as 5 amino aads m 
polypeptides up to 50 amino adds long. One may mdqueiy identify the sequence of 
eadi peptide by providing for eadi solid support an identifier tag spedfying only ^e 
amino adds varied in eadi sequence, as will be readily appredated by those skiUed m 

10 the art. m sudi cases, all soKd supports may remain in the same reaction vessel for 
the addition of common monomer units and apportioned among different reaction 
vessels for the addition of distinguishing monomer units. 

The identifier tag can be assodated with the oUgomer through a variety 
of medianisms, dther directiy, through a linking molecule, or through a soUd 

1 5 support upon whidi the oligomer is synthesized. In tiie latter mode, one could also 
attadi the tag to another soUd support that, in turn, is bound to ttie soUd support 
upon which the oligomer is synthesized. 

IV. Typps of Id< »nrifier Tags 
20 The identifier tag may be any recognizable feature tiiat is, for example: 

microscopically distinguishable in shape, size, color, optical density," etc; differentiy 
absorbing or emitting of light; chemically reactive; magnetically or eledromcaily 
encoded; or in some other way distinctively marked with the required information, 
and dedpherable at the levd of one (or few) soUd supports. In one embodiment, eadi 

2 5 " bead or other solid support in the Ubrary incorporates a variety of fluorophores, or 

other Hght addressable type of molecules, the spedral properties of which can be 
dianged and tiierefore used to store information. In one sudi mode, a bead 
incorporates a variety of fiuorophors, eadi of whidi can be selectively photobieadied, 
and so rendered incapable of fluorescence or of diminished fluoresence. Durmg eadi 

3 0 coupling step, the bead is irradiated (or not) to photobleadi (or not) one or more 

particular types of fiuorophors, thus recording the monomer identity in the oUgomer 
synthesized. See Sdoice 255: 1213 (6 Mar. 1992), incorporated herein by reference. 

One can consti^ctmiaoscopically identifiable tags as smaU beads of 
recognizably different sizes, shapes, or cotors, or labded with bar codes. Hie tags can 
3 5 be "madune readable" luminescent or radioactive labds. The identifier tag can also 
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be an encodable molecular structure. The information may be encoded in the size 
(eg. length of a polymer) or the composition of the molecule. The best example of 
this latter type of tag is a nucleic add sequence, i.e., RNA or DNA assembled from 
natural or modified bases. 
5 Synthetic oligodeoxyriboriucleotides are especially preferred 

information-bearing identifier tags. Oligonucleotides are a natural, high density 
information storage medium. The identity of monomer type and the step of addition 
is easily encoded in a short oligonucleotide sequence and attached, for example, to 
each peptide synthesis bead. When a single bead is isolated by saeening, e.g., for 

1 0 receptor binding, the attached oligonucleotides can be amplified by methods such as 

PGR (see PGR Protocols: A Guide to Methods and Applications. (Innis, M, Gelfand, D., 
Sninsky, J. and White, T., Academic Press, San Diego 1990), incorporated herein by 
reference), or by other nucleic add amplification techniques, such as the ligase chain 
reaction and the self-sustained sequence replication system. The amplified product 
15 can be easily sequenced or otherwise identified to decode the identity of the peptide on 
the bead. For this purpose, one can use any of a variety of sequencing methods, 
induding sequencing by sequence-spedfic probe hybridization. 

Alternatively, the information may be encoded in the length rather than, 
or in addition to, the sequence of the oligonudeotide. If only oligonudeotide length 

2 0 is utilized to represent each specific monomer addition to the oligomer, then the 

identity of the oligomer can be decoded by amplifying the oligonudeotide, as described 
above, and identifying the labels through any of a variety of size-separation 
techniques, induding polyacrylamide gel dectrophoresis or capillary electrophoresis. 

There are several ways that oligonudeotides can be used as identifier 

2 5 tags. The oligonudeotides can be assembled base-by-base before, during, or after the 

corresponding oligomer (e.g., peptide) synthesis step. In one case of base-by-base 
synthesis, the tag for each step is a single nudeotide, or at most a very few nudeotides 
(i.e., 2 to 5). This strategy preserves the order of the steps in the linear arrangement of 
the oligonudeotide chain grown in paralld with the oligomer. To preserve the 

3 0 chemical compatibility of the paralld synthetic steps (oligonudeotides and peptides, 

for example), one can modify the standard synthesis chemistries.- 

One variation of base-by-base assembly is the block-by-block approach; 
encoded sets of nudeotides ("codons") of 5 to 10 or more bases are added as protected, 
activated blocks. Each block carries the monomer-type information, and the order of 
3 5 addition represents the order of the monomer addition reaction. Alternatively, the 
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block may encode the oligomer synthesis step number as weE as the monomer-type 
information. 

One could also attach protected (or unprotected) oUgonudeotides 
containing amplification primer sites, monomer-spedfic information, and order-of- 
reaction information, from 10 to 50 to 150 bases in length, at each step. At the end of a 
series of n oUgomer synthesis steps, there would be n differently encoded sets of 
oUgonudeotide identifier tags assodated with eadi oligomer sequence. After 
identifying the oligomers widi hgand activity, the assodated oligonudeotides are 
amplified by PGR and sequenced to decode the identity of the oligomer. 



/. T inkin g the Trfpntifier Tag (g) tn thp OH^omer 

The identifier tags may be attached to chemically reactive groups 
(unmasked thiols or amines, for example) on the surface of a synthesis support 
functionalized to allow synthesis of an oUgomer and attadunent or synthesis of the 

1 5 oUgonudeotide identifier tag. The tags could also be attadied to monomers that are 

incorporated into a small proportion of the oUgomer diains; or as caps on a smaU 
number of the oligomer diains; or to readive sites on linkers joining the oligomer 

diains to the solid support. 

In one embodiment, the soUd supports will have chemically reactive 

2 0 groups that are proteded using two different or "orthogonal" types of protecting 

groups. The soHd supports will then be exposed to a first deprotection agent or 
activator, removing the first type of protecting group from, for example, the 
diemicaUy readive groups that serve as oUgomer synthesis sites. After reaction with 
the first monomer, the soUd supports wiU then be exposed to a second activator whidi 

2 5 ' removes the second type of protecting group, exposing, for example, the diemically 

reactive groups that serve as identifier tag attadunent sites. One or both of the 
activators may be in a solution that is contaded with the supports. 

In another embodiment, the linker joining the oUgomer and the solid 
support may have diemicaUy reactive groups proteded by the second type of 

3 0 proteding group. After readion with the first monomer, the soUd support bearing the 

Unker and the "growing" oUgomer wiU be exposed to a second activator whidi 
removes the second type of protecting group exposing the site that attadies the 
identifier tag directly to the Unker, rather than attadunent directly to the soUd 
support. 

3 5 When activators or deprotection agents are incorporated into the 
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method of preparing a synthetic peptide library having a pixiraiity of different 
members, each member comprising a solid support attached to a different single 
peptide sequence and an oligonucleotide identifier tag identifying said peptide 
sequence, the method comprises: a) apportioning the solid supports among a 
'5 plurality of reaction vessels; b) reacting the solid supports with a solution in each 
reaction vessel and treating sequentially with (1) a first activator to remove a first type 
of protective group fi^om the solid support, (2) a first amino add or peptide to couple 
said amino add or peptide to said solid support at sites where said first type of 
protective group has been removed; (3) a second activator to remove a second type of 

1 0 protective group from the solid support; and (4) a first nudeotide or oiigonudeotide 

tag to couple said tag at sites where said second type of protective group has been 
removed; c) pooling the solid supports; d) apportioning the pooled solid supports 
among a plurality of reaction vessels; and e) repeating step (b) to couple a second 
amino add or peptide and a second nudeotide or oiigonudeotide tag to said solid 
1 5 support. 

As noted above, the invention can also be carried out in a mode in 
which there is no solid support, and the tag is attached directly (or through a linker) to 
the oligomer being synthesized. The size and composition of the library will be 
determined by the nimiber of coupling steps and the monomers used during the 
ZO sjmthesis. Those of skill in the art recognize that either the tag or the monomer may 
be coupled first, in either embodiment 

Another possible embodiment is the use of two solid supports, such as 
beads, that are physically linked together, one with synthesis sites 
(or linkers) for the ohgomer and one with attachment sites (or linkers) for the 

2 5 identifier tags. This arrangement allows the segregation of oligomers and identifier 

tags into discrete "zones" and permits the use of widely different chemically reactive 
groups and chemistries for attachment. The solid supports can be derivatized 
separately and then linked under conditions where all or nearly all of the synthesis 
solid supports will have a tag-attachment solid support in tow. The solid supports can 

3 0 be of different sizes, as for example a large synthesis bead with several (or many) 

smaller tag-attachment beads linked. In one embodiment, the first solid support will 
have at least one attached amino add and the second solid support will have at least 
one attached nudeotide. 

The mode of linking the two beads is constrained by the chemistry of 
3 5 oligomer synthesis. The most obvious means of linking the beads is with a 
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heterobifunctional cross-linking agent (for examples of such agents, see Pierce 

T n,nTprhnnlog^^ r.f.lnff .nd Handbook pp. E10-E18 (1991)) interacting with the 

dominant chemically reactive groups on each spedes of solid support. 

5 VI. T7nrnHiri| r the Identifier Tag Ln formation 

The choice of bases used in an oUgonudeotide identifier tag is dictated by 
the chemistry of oligomer synthesis. For example, the use of strong add to deprotect 
peptides would depurinate nudeic adds. Therefore, when standard diemistries for 
peptide synthesis are employed, the pyrimidines C and T could be used in a binary 

1 0 code- Thus, in a preferred embodiment, the identifier tag wiU be an oligopyrimidine 

sequence. 

In another embodiment, the lability of purine nudeotides to strong add 
may be overcome through the use of the purine nudeoside analogs, such as 7-deaza- 
Z-deoxyadenosine and 7-deaza-2'-deoxyguanosine (see Barr et al-. BioTechniques 
1 5 4:428-432 (1986), and Sdieit, Murleotide A T ialop-s: Svnthpsis and Biological Function 
pp. 64-65 (John Wiley and Sons, New York), both of which are herein incorporated by 
reference). Use of these or other analogs would permit the use of a quaternary or 
other, as opposed to a binary, encoding scheme. 

Information retrieval from oUgonudeotide identifier tags is possible 

2 0 through various encryption schemes, two of which are described below. In tiie first, 

the ohgomer sequence information is at least in part encoded in the lengtii of tiie 
oUgonudeotide. Each different monomer added at a given step in the oUgomer 
synthesis may be represented by an oUgonudeotide tag of unique length. The 
oUgonudeotide inherentiy contains amplification sites, such as PGR priming 

2 5 sequences, characteristic of the given step-number in the oligomer syntiiesis. 

Determination of the oUgomer composition at any given position in the sequence 
tiien involves ampUfying the tag using the PGR priming sequence characteristic for 
tiiat step in the synthesis and size-separating the ampUfication products utiUzing 
techniques weU known in tiie art, such as gel or capiUary electrophoresis (using the 

3 0 tagging oUgonudeotides as standards) This embodiment is particularly useful when 

one desires to make a Ubrary of compounds related to a lead sequence. One need only 
tag during steps in which a site being analoged is synthesized. 

&i addition to length, oUgomer sequence information can also be 
encoded in the sequence of bases comprising the oUgonudeotide tag. This type of 
3 5 encryption is of value not only in the embodiment in which one attaches a different 
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oligonucleotide tag at each coupling step but also in the embodiment in which one 
extends an oligonucleotide tag at each coupling step. For example, as shown in Fig. 4, 
one may use oligonucleotides of up to about 100 bases (or somewhat longer), each 
having seven regions, as described below. 
5 Region 1 is a 3'-PCR primer site (20 to 25 bases). This site is used in 

conjunction witii another PGR site (at the 5'-end of the oUgonudeotide) to prime 
amplification by PGR. Other amplification methods may also be used. 

Region 2 is a "step-specific" DNA sequencing primer site (15-20 bases). 
This site is specific for the particular numbered step in the synthesis series* All the 

1 0 oligonucleotides added to all the beads at a particular step will have this sequence in 

common. Each numbered step wUl have a highly specific primer site representing 
that step. 

Region 3 is a spacer (20-30 bases). A spacer segment of variable length, 
but preferably 20 to 30 bases long, places tiie coding site sufficientiy distant from the 
1 5 sequencing primer site to give a good "read" through the monomer encoding or 
identification region. 

Region 4 is a monomer identification region (8 bases). Each base in this 
string represents one bit of binary code, where, for example, T = 0 and C = 1. Each set 
of step-specific identifier tags consists of 8 bases with a 1 (C) or a O (T) at each of the 8 

2 0 positions. These may be thought of as switches set to "on" or "off" at the different 

positions. Each monomer type is encoded by a mixture of 1 to 8 of these "switches." 

Region 5 is a step number confirmation region (4 bases plus 2 bases on 
either side for region distinction). Four bits in this short stretch encode the step 
mmiber. This is redundant to the sequencing primer but can be used to confirm that 

2 5 the proper primers were used and that the right step is decoded. 

Region 6 is a repeat of the monomer identification region (8 bases). This 
region has the same information as region 4, and is used to confirm monomer 
identity. Installing this second monomer encoding region also increases the 
probability that a good sequencing "read" will be obtained. 

3 0 Region 7 is a 5 -PGR primer site (20 to 25 bases). This site serves as a site 

for annealing the second PGR primer for amplification of the sequence. The length of 
oligonucleotides with all seven of these features, some of which are optional, will 
commonly be between 75 and 125 bases. 

An 8 bit format can encode 256 different monomer types. The niraiber of 
3 5 steps that can be enraded is determined by the nimU^er of step-specific sets (8 per set) of 
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oUgonudeofides on hand. With 10 sets (80 oUgos) one can encode up to 256 different 
monomers assembled into oUgomers up to 10 units long (thus providing encoding 
capabiUty for up to 25610 = i2x 1024 oligomer sequences). The coded identifier tags 
may be used so that each monomer is assigned a specific binary number (e.g. Ala = 
5 00000001, Gly = OOOOOIIO, etc). The appropriate oligonucleotides are combined to give 
the correct binary code. 

VH. T^pcnvering and DecodiTifr the Iden tifier Tag Information 

When specific beads are isolated in a receptor screening experiment, the 
1 0 beads can be segregated individuaUy by a number of means, including: infinite 

dflution, micromanipulation, or preferably, fluorescence activated cell sorting (FACS), 
although, with respect to the present invention, FACS is more accurately 
"fluoresance activated oUgomer or solid support sorting" (see Methods in Cell 
Biology, Vol. 33 (Darzynkiewicz, Z. and Crissman, ILA., eds.. Academic Press); and 

1 5 Dangl and Herzenberg, J Trnmunol. Methods 52:1-14 (1982), both incorporated herein 

by reference). Once the desired beads have been isolated, one needs to identify the tag 
to ascertain the sequence of the oligomer on the bead. 

To facilitate tag identification, one has a variety of options. For instance, 
one could read the tag directiy from the bead by sequencing or hybridization, if the tag 
20 is an oKgonudeotide. One can also amplify oligonudeotide tags to f adUtate tag 

identification. The oUgonudeotide identifier tags carried by a single soUd support or 
oUgomer can be amplified in vivo, by doning, or in vjftQ, e.g., by PCR. If the limit of 
detection is on the order of 100 molecules, then, at least 100 or more copies of each 
oUgonudeotide tag on a bead would be required. Copies of ±e tag are produced, 

2 5 eitiia: as single stranded oligonudeotides, double-sti-anded nudeic adds, or mbctures 

of single and double-stranded nuddc adds, by any of a variety of metiaods, several of 
which are described below, and the amplified material is sequenced. In tiie 
embodiment of the invention in which a separate and distind oUgonudeotide tag is 
added at each monomer addition step (as opposed to extending an existing tag at each 

3 0 step), one can ampUfy aU tags at once and tiien divide tiie amplified material into as 

many separate sequencing reactions as there were oUgomer S3mthesis steps 
(employing a different sequencing primer for each type of tag). In tiiis embodiment, 
one could also design the tags so ti\at each tag could be amplified separatdy from the 
other tags by appropriate choice of primer sequences- The sequencing reactions are 
3 5 perft>rmed and run on a standard sequencing gel, and the oUgomer sequence is 
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deduced from the code revealed in the resulting sequence information. 

An alternative strategy is to use corrunon PCR primers and common 
sequencing primers (the sequencing primer may even overlap completely or partially 
with a PCR primer site) and identify the step by hybridization to oligonucleotide 
5 probes that are complementary to each step-specific sequence in the oligonucleotides 
from the bead. A single set of sequencing reactions is performed on all of the 
amplified oligonucleotides from a single bead, and the reaction products are run in-a 
single set of lanes on a gel. The reaction products are then transferred to a suitable 
hybridization membrane and hybridized to a single step-specific probe (see Maniatis et 
1 0 al.. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1982), v^hich is 

incorporated herein by reference). After detection of the resulting signal, the probe is 
washed from the membrane and another step-specific probe is hybridized. One could 
also use the procedure described in EPO publication No. 237,362 and PCT publication 
No. 89/11548, each of which is incorporated herein by reference. 

1 5 Parallel hybridization provides an alternative to sequential 

hybridization. The sequencing reactions are divided into a nimiber of aliquots equal 
to the number of peptide synthesis steps and nm in a separate set of lanes for each on 
the sequencing gel. After transfer of the reaction products to a suitable membrane, the 
membrane is cut to separate the sets of lanes. Each lane set is then hybridized to one 

2 0 of a plurality of step-spedfic oligonucleotide probes (see 'TJniplex DNA sequencing" 

and "Multiplex DNA sequencing," in Plex Luminescent Kits Product Cataloe. Bedford, 
MA, 1990, incorporated herein by reference). 

As noted above, a single synthesis solid support (or an attached bead 
bearing a tag, or in solution in a "well") may only comprise a few hundred copies of 

2 5 each oligonucleotide tag. These tags may be amplified, e.g., by PCR or other means 

well known to those skilled in the art, to provide sufficient DNA to be sequenced 
accurately. The ability to decode the oligomers depends on the number of available 
oligonucleotide identifier tags, the level of amplification that can be achieved from 
the available tags, and the accuracy of sequencing that amplified DNA. 

3 0 The most commonly used in vitro DNA amplification method is PCR. 

Alternate amplification methods include, for example, nucleic add sequence-based 
amplification (Compton, Nature 350:91-92 (1991), incorporated herein by reference) 
and amplified antisense RNA (Van Gelder et al., Proc. Nat Acad. Sci. USA 85:7652- 
7656 (1988), incorporated herein by reference), and the self-sustained sequence 
3 5 repUcation system (3SR, see GuatelU et alv Proc. Nad. Acad. Sci. USA 87: 1874-1878 
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(1990), incorporated herein by reference). 

If PGR amplification of an oUgonudeotide identifier tag is employed, one 
may encounter ■'PCR product contamination," caused by the product of one PGR 
reaction contaminating a subsequent PCR reaction mixture designed to amplify other 
5 tags having the same PGR primer binding sites. One may prevent this problem by 
introducing labiUty into the product sequences and treating subsequent reactions so as 
•to destroy potential contamination carried over from previous reactions. A specific 
example of this strategy, for which commercial kits are sold by PEO and laf e 
Technologies, is to introduce dUMP into die product Treating each new PCR reaction 
10 with uracil-N-glycosidase degrades any dU-containingDNA present, preventing 

amplification of the contaminant. The template DNA, which contains no dU (only 
dT) is not affected. Of course, the glycosidase is removed or inactivated before 

amplification is begun. 

Some of the tags described above for peptide syntiiesis have the unusual 

1 5 characteristic of containing only pyrimidines. This means that the uracil glycosidase 
strategy (Perkin Ehner Cehis Instruments (PEQ) Catalog, Alameda (1991), 
incorporated herein by reference) will work on only half of the stirands produced- 
those containing Ts (or IPs). One cannot introduce dUMP into the complementary, 
purine-only stirand; however, the purine strand is highly vulnerable to aad 

20 depurination and alkaline-mediated scission of tiie backbone. The combination of 
these treatinents can greatiy reduce problems with product contamination. Another 
approach to preventing carryover contamination involves incorporation of a 
restriction site (Earl could be used for polypyrimidine tags) into the oUgonudeotide 
tag and digestion with the corresponding restiiction enzyme prior to amphfication of 

2 5 ' a reaction suspected of being contaminated with the tag. This method only works if 

tiie tag to be amplified will not be deaved by the enzyme, as would generally be the 
case for a single stranded oligonucleotide tag. 

For sequendng amplified DNA, one usually desires to generate smgle 
sti^ed templates. This generation may be accomplished by any of several means. 

3 0 One sudi means is asvmmetric PCR, where an excess of one of the primers is used to 

amplify one strand to a level 10 to 100-fold hi^er than the other (see, for example, 
U S. Patent No. 5,066^84, incorporated herein by reference). Anodier means of 
providing a single sti^ed template is to by biotinylate one of the primers and purify 
or remove the resulting strand by adsorption to immobilized streptavidin (EiSS 
35 T nfprhnnlQgy ^-^'"f '^^^ Handbook. 1991, incorporated herein by reference). 
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Yet another means involves generation of RNA transcripts (representing only one of 
the strands) from an RNA polymerase promoter and sequencing the transcripts with 
reverse transcriptase (Sommer et al., Chapter 25, In PCR Protoco ls: A Guide to 
Methods and Applications, supra, incorporated herein by reference). If the tags are 
5 composed of only pyrimidine nucleotides, then all purine strands can be eliminated 
by add /base treatment, leaving the pyrimidine strand for sequencing. 

The use of separate sequencing primers for each step-specific 
oligonucleotide requires a separate, conventional sequencing reaction for each step- 
specific primer. Using primers that are differentially labeled would allow the 
1 0 identifier tags from a single solid support to be sequenced in a single reaction and run 
in a single lane set (2 lanes) on a gel. There are now commercially available primers 
labeled with distinguishable fluorophores that are suitable for this purpose (ABI 
Catalog, incorporated herein by reference). Sets of chemiluminescent labels now 
distributed commercially may also be used (Bronstein et al„ BioTechniques 8: 310-314 

1 5 (1990), incorporated herein by reference). 

DNA sequencing enzjones which may be employed in the present 
invention indude Tag DNA polymerase, E. coli DNA polymerase I (or the Klenow 
fragment), T7 polymerase, SequenaseTM and Sequenase IlTM (Modified T7 DNA 
polymerases), Bst DNA polymerase, and reverse transcriptase (from AMV, MMLV, 

2 0 RSV, etc., see USB Enzvmes for DNA Sequendne, U.S. Biochemical Corp, 1991, 

Cleveland OH, incorporated herein by reference). 

The sequence of an oiigonudeotide tag may also be identified by a high 
fidelity DNA hybridization technique. To this end, very large scale immobilized 
polymer synthesis with oligonudeotides may be useful (see PCT patent publication 

2 5 Nos. 92/10587 and 92/10588, each of which is incorporated herein by reference). 

Vm. Screening Receptors with Synthetic Oligomer Libraries 

The tagged synthetic oligomer libraries of the present invention will 
have a wide variety uses. By way of example, these libraries can be used in 

3 0 determining peptide and nudeic add sequences that bind to proteins, finding 

sequence-specific binding drugs, identifying epitopes recognized by antibodies, and 
evaluating a variety of drugs for dinical and diagnostic applications, as well as 
combinations of the above. Peptides as short as about five amino adds long might be 
useful in receptor-binding studies, for example. 
3 5 Synthetic oligomers displayed on small beads can be screened for the 
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abiUty to bind to a receptor. The receptor may be contacted with the hbrary of 
synthetic oHgomers, forming a bound member between an receptor and the oUgomer 
able to bind the receptor. The bound member may then be identified. As one 
example, the receptor may be an immunoglobulin. 
5 The techniques for selection of individual beads displaying hgands on 

their surface are analogous to FACS methods for doning mammalian cells expressmg 
cell surface antigens or receptors. Therefore, methods for selecting and sortog beads 
will be readily apparent to those skilled in the art of cell sorting. For example, a 
receptor can be labelled with a fluorescent tag and then incubated with the mixture of 

1 0 beads displaying oligomers. After washing away unbound or non-spedfically bound 
receptors, one can then use FACS to sort the beads and to identify and isolate 
physically individual beads showing high fluorescence. 

Alternatively, affinity adsorption techniques can be employed m 
conjuction with the Ubraries of the invention. The mixture of beads can be exposed to 

1 5 a surface on which a receptor has been immobilized (see PCI patent pubhcation No. 
91/07087, incorporated herein by reference). After washing to remove unbound beads, 
one can then elute beads bound to the surface using conditions that reduce the avidity 
of the oUgomer/receptor interaction Oow pH, for example). The process of aftmity 
adsorption can be repeated with the eluted beads, if desirable. Finally, individual 

20 beads are physically separated, for example, by limited dilution, by FACS, or by 

methods similar to titose in which cells are incubated with a receptor coupled to small 
superparamagnetic beads and then cells expressing a ligand for the receptor ^e 
extracted using a high power magnet (see Miltenyi et al. Cytometery 11.231-238 1990), 
incorporated herein by reference). MagneticaEy selected ceUs can be further analyzed 

2 5 and sorted using FACS. Radionudeotides may also serve to label a receptor. 

Alternatively, the present invention can be used to generate kbranes of 
soluble tagged oHgomers, whidi can be used with a variety of screening methods. For 
instance, the oUgomer Ubrary can be syntiiesized on beads with an identifying tag 
encoding the oligomer sequence, live microscopic beads are placed in mdividual 

3 0 compartments or wells that have been "nanofabricated" in a silicon or other suitable 

surface- The oligomers are deaved from tiie beads and remain contamed within tiie 
compartment along with tiie bead and the attadied identifier tag(s). In one 
embodiment, the bottom surface is coated witii the receptor, and after the addition of 
binding buffer and a known Ugand for that receptor tiiat is fluorescentiy labdled, one 
3 5 effectively has a solution phase competition assay for novel Ugands for the receptor. 
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The binding of the fltiorescently labelled ligand to the receptor is estimated by confocal 
imaging of the monolayer of immobilized receptor. Wells with decreased 
fluor^cence on the receptor surface indicate that the released oligomer competes with 
the labelled ligand. The beads or the tag in wells showing competition are recovered, 
5 and the oligonucleotide tag is amplified and sequenced to reveal the sequence of the 
oligomer. 

The beads are loaded in the wells by dispersing them in a volume of ^ 
loading buffer sufficient to produce an average of one bead per well. In one 
embodiment, the solution of beads is placed in a reservoir above the wells, and the 
1 0 beads are allowed to settle into the wells, Qeavage of the oligomers from the beads 
may be accomplished using chemical or thermal systems, but a photodeavable system 
is preferred. 

Recovery of identifier-tagged beads from positive wells may be 
effectuated by a micromanipulator plucking out individual beads. However, a 

1 5 preferred mode involves the use of beads that have been previously labelled with a 

fluorescent tag. A laser of the appropriate wavelength is then used to bleach the 
resident beads in only the positive wells. All the beads are then removed en masse 
and sorted by FACS to identify the bleached positives. The associated tags may then be 
amplified and decoded. 
20 In a variation of this assay, the oligomer and tag may be synthesized 

attached to a common linker, which, in turn, is bound to the solid support. After 
placing the beads in the wells, one can cleave the linker from the bead, producing a 
tagged oligomer in solution. An immobilized receptor, such as a receptor bound to a 
bead or a receptor immobilized on one surface of the well, can be screened in a 

2 5 competition assay with the oligomer and a fluorescently labeled ligand. Instead of 

recovering the beads, one may recover the beads bearing immobilized receptors and 
sort tiie beads using FACS to identify positives (diminished fluorescence caused by the 
library oligomer competing with the labeled ligand) or one can determine the 
fluorescence enutting from the well surface coated with receptor. The associated 

3 0 identifier tag may then be amplified and decoded. 

In a third variation of this approach, soluble tagged oligomers, produced 
either by cleavage of the linked oligomer and tag firom the solid support as described 
above, or synthesized by theVLSIPS™ method described above, or synthesized in 
solution vwthout a solid support, are incubated with an immobilized receptor. After a 
3 5 wash step, the boimd, tagged oligomers are released from the receptor by, e.g., add 



wo 93/06121 



PCT/US92/07815 



32 



treatment. The tags of the bound oligomers are amplified and decoded. 

IX. Att Automatpri Tnstrumer <- far Oli^ompr Synthesis and Tagging 

The coupling steps for some of the monomer sets (amino acids, for 
5 example) require a lengthy incubation time, and a system for performing many 
monomer additions in parallel is desirable This can be accomplished with an 
automated instrument able to perform 50 to 100 paraUel reactions (channels). Such an 
instrument is capable of distributing the reaction mbcture or slurry of synthesis soUd 
supports, under programmable control, to the various channels for pooling, mixing, 

1 0 and redistributioiu 

Much of the plumbing typical of peptide synthesizers is required, with a 
large number of reservoirs for the diversity of monomers and the number of tags (up 
to 80 for a 10 step synthesis, in one embodiment) employed. The tag dispensing 
capabiKty will translate simple instructions into the proper mixture of tags and 

15 dispense that mixture Monomer building blocks will also be dispensed, as desired, as 
specified mixtures. Reaction agitation, temperature and time control may be 
provided. An appropriately designed instrument may also serve as a multi-channel 
peptide synthesizer capable of producing 1 to 50 mgs (crude) of up to 100 specific 
peptides for assay purposes. See PCT patent pubUcation 91/17823, incorporated herein 

20 by reference. 

EXAMPLE L SYNTHESIS ON GLASS BEADS OF 4 FLUORESCENTLY TAGGED 

PENTAPEFITDES 

A. DerivatiT^afinn of Glass Beads 

2 5 About 05 g of 3-10 urn diameter silica beads (Polysdence) were washed 

with refluxing 10% aqueous HNO3 for 20 min. The beads were pelleted and washed 
with distiHed water (5x) and methanol (3x) and dried at 125 degrees C for 12 hours. 
Beads were vortexed with a 5% solution of aminopropyitriethoxysilane in acetone for 
10 hours, pelleted and then washed with acetone (2x), ethanol (5x), and methylene 

3 0 chloride (2x) and dried at 125 degrees C far 45 min. Beads were suspended in dry DMF 

a mL) containing diisopropylethylamine (17 ^1, 100 nmoles) and a solution of Fmoc- 
b-alanine, pentafluorophenyl ester (200 mg, 420 pnoles. Peninsula Labs) in distilled 
water (15 mL) was added. After vortex treatment for 11 hours, the beads were 
pelleted and washed with DMF (3x) and methylene chloride (2x). Beads were treated 
3 5 with a 10% solution of acetic anhydride in DMF containing 0.05 mol of 4- 
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dimethylaminopyridihe to cap any underivatized aminopropyi groups, and then 
washed with DMF (2x) and methylene chloride (2x). Beads were vortexed with a 20% 
solution of piperidine in DMF and the release of the Fmoc-piperidine adduct 
quantitated by monitoring the absorbance spectrum of the supernatant at 302 nm 
5 (^302 = 7800 M- 1 cm- 1). An estimate of the degree of substitution of 10 jimoles of 
amino groups/g beads was thus obtained. Finally, the beads were washed with 
ethanol (5x) and methylene chloride (2x) and then dried at 85 degrees C for 12 hours. 

B. Preparation of Boc-Gly-L-Phe-L-Leu-OH 

1 0 Glycyl-L-Phenylalanyl-L-leucine (552 mg, 1.5 mmol, Bachem) was 

dissolved in a solution containing distilled water (10 mL) and 1 M NaOH (1.5 mL). 
The solution was cooled in an ice bath and was treated with a solution of di-tert-butyl 
pyrocarbonate (337 mg, 1.5 mmoi) in p-dioxane (12 mL). A white precipitate rapidly 
formed but redissolved after stirring_ at room temperature for 4 hours. The solution 

1 5 was concentrated to dryness in vacuo, the residue taken up in water (5 mL), and the 
pH adjusted to 2.5 by the addition of 1 M KHSO4. The aqueous suspension was 
extracted with EtOAc (2x, 15 mL), the organic layer separated, and dried over MgS04. 
After removal of the solvent in vacuo, the residue was triturated with hexane to 
afford Boc-Gly-L-Phe-L-Leu-OH as a white solid (yield=642 mg, 98%). 

20 

C Preparation of Gly-L-Phe-L-Leu Beads 

Boc-Gly-L-Phe-L-Leu-OH (44 mg, 0.1 mmol), benzotriazol-1- 
yloxytris(dimethyiamino)phosphoniixm hexafluorophosphate (44 mg, 0.1 mmol) and 
l-hydroxybenzotriazole hydrate (14 mg, 0.104 mmol) were dissolved in dry DMF (1 

25 mL). Diisopropylethylamine (20 pi/ 0.115 mmol) was then added and 0.65 mL of this 
solution was immediately transferred to a miaocentrifuge tube containing 80 mg of 
amino derivatized glass beads. The sealed tube was vortexed for 3.5 hours, and the 
beads were then pelleted and washed with DMF (3x) and methylene chloride (2x). 
The beads were then deprotected with a 50% solution of trifluoroacetic acid in 

3 0 methylene chloride for 30 min., washed with metiiylene chloride (2x), ethanol (2x), 
and methylene chloride (2x), and dried at 55 degrees C for 1 hour. 



35 



D. Preparation of Gly-Gly-L-Phe-L-Leu Beads (SEP ID NO:10) 

Fmoc-glydne pentafluorophenyl ester (46 mg, 0.1 mmol) was dissolved 
in dry DMF (1 mL) containing diisopropylethylamine (17 ^1, 0.1 mmol). About 0,65 
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mL of this solution was added to 20 mg of Gly-L-Phe-L-Leu beads in a microcentrifuge 
tube, and the tube was vortexed for 3 hours. The beads were peUeted and washed 
with DMF (4x) and methylene chloride (2x). Deprotection was effected by treatment 
with a 20% solution of piperidine in DMF for 30 min. Hie beads were washed with 
5 DMF (2x), ethanol (2x), and methylene chloride (2x) and dried at 60 degrees C for 4 
hours. 

E. Prpp;,ration of T -Prrvmv-T-Phe.T.-T.eu Beads (SFP TP NQill) 

Fmoc-L-proline pentafluorophenyl ester (50 mg, 0.1 mmol) was 
1 0 dissolved in dry DMF (1 mL) containing diisopropylethylamine (17 jil, 0.1 mmol). 
About 0.65 mL of this solution was added to 20 mg of Gly-L-Phe-L-Leu beads in a 
microcentrifuge tube, and the tube was vortexed for 3 hours. The beads were pelleted 
and washed with DMF (4x) and methylene chloride (2x). Deprotection was effected by 
treatment with a 20% solution of piperidine in DMF for 30 min. The beads were 
1 5 washed with DMF (2x), ethanol (2x), and methylene chloride (2x) and dried at 60 
degrees C for 4 hours. 

F. fliinrPficein Sf aminff nf Glv-Tilv-T.-Phe-L-Uu BeadS 

About 5.4 mg of Gly-ay-L-Phe-L-Leu beads were suspended in 450 fd of 
20 aqueous borate buffer (pH 8.5) and 54 ^1 of a 10 nM solution of fluorescein 

isothiocyanate (FTTC) added. After vortex treatment for 1.5 hours, the beads were 
washed with buffer (5x), ethanol (2x), and methylene chloride (2x). FACS analysis 
indicated that approximately 10% of available amino groups had been titrated with 
-FETC. 

25 

G. (-n-rnn plin^ pfT -Tyrosine an H Biotm to Mixti.rP nf T.-Pro-GIv-T.-Phe-L-Leu and 
FITClab&nffl GIv-Glv- T-pTiP-T.-T^u Beads 

5 mg of FTTC labelled Gly-Gly-L-Phe-L-Uu beads and 5 mg L-Pro-GIy-L- 
Phe-L-Leu beads were mixed together in a single tube, vortexed with a 0.1 mM 

3 0 solution of diisopropylethylamine in methylene chloride, and the suspension was 
divided into two equal portions. Hie beads were peUeted, and to one portion was 
added a solution containing Fmoc-O-tert-butyl-L-tyrosine pentafluorophenyl ester (59 
mg, 95 \uriol), N-hydroxysucdnimidobiotin (1.7 mg, 5 ^mol) and 
diisopropylethylamine (17 ^1, 100 junol) in dry DMF (1 mL). After vortexing for 3 

3 5 hours the beads were washed with distilled water (2x), ethanol (2x), metiiylene 
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chloride (2x) and DMF (Ix). Fmoc deprotection was effected by treatment with a 20% 
solution of piperidine in DMF for 30 min., and tert-butyl side chain protecting groups 
were removed by treatment with 25% trifluoroacetic acid in methylene chloride for 30 
min. The pelleted beads were washed with methylene chloride (2x), ethanol {2x), and 
5 TBS (be). 

H. R-Phycoervthrin Staining of Biotinvlated L-Tyr-(Glv/L-ProVGlv-L-Phe-L-Leu 
Beads (Mixture of SEP IP NQ:12 and SEP ID NO:13) 

Biotinylated L-tyrosine beads from (G) above were suspended in TBS (0.5 

1 0 mL) and treated with 10 jil of R-phycoerythrin-avidin conjugate (Molecular Probes) 

for 30 min. Pelleted beads were washed with TBS (5x). 

L Co-coupling of L-Proline and Biotin to Mixture of L-Pro-GlvL-Phe- 

T..Leu and FTTC labelled Glv-Glv-L>Phe-L-Leu Beads (Mixture of SEP IP NP:15 
15 and SEOIPNP:14) 

5 mg of a mixture of L-Pro-Gly-L-Phe-L-Leu and FITC labelled Gly-Gly- 
L-Phe-L-Leu beads were treated with a solution containing Fmoc-L-proline 
pentafluorophenyl ester (48 mg, 95 ^mol), N-hydroxysucdnimidobiotin (1.7 mg, 5 
jimol), and diisopropylethylamine (17 pJ, 100 ^unol) in dry DMF (1 mL). After 

2 0 vortex treatment for 3 hours, the beads were washed with DMF (2x), ethanol (2x), 

methylene chloride (2x), and DMF (Ix). Fmoc deprotection was effected by treatment 
with a 20% solution of piperidine in DMF for 30 min,, and by way of control, the 
beads were treated with 25% trifluoroacetic acid in methylene chloride for 30 min. 
The pelleted beads were washed with methylene chloride (2x), ethanol (2x), and TBS 
25 (be). 

J, Tri-Color Staining of Biotinylated L-Pro-fGlv/L-ProVGlv-L-Phe'L' 
Leu Beads 

Biotinylated L-proline beads from (i) above are suspended in TBS (0.5 
30 mL) and treated with 20 \d Tri-Color streptavidin conjugate (Caltag Labs) for 30 min. 
Pelleted-beads are washed with TBS (5x)- 

K. Selection of Beads Containing Peptide Ligands for Monoclonal Antibody 3E7 
Monoclonal antibody 3E7 was raised against the opioid peptide beta- 

3 5 endorphin- The binding specificity of MAb 3E7 has been well characterized by 
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solution assays with chemically synthesized peptides. The equilibrium binding 
constants (Kd) of the peptides considered here are as foUows: YGGFL is 6.6 nM; and 
YPGFL, PPGFL, and PGGFL are each >1 mM; thus, only the peptide YGGFL shows 
appreciable afBnity for the antibody. 
5 A mixture of beads containing either YGGFL, YPGFL, PGGFL, or PPGFL 

and their respective tags (see above) are added in phosphate buffered saline (PBS) 
containing monodonal antibody 3E7 that has been previously conjugated to colloidal 
superparamagnetic microbeads (Miltenyi Biotec, West Germany). After a 16 hr 
incubation at 4 degrees C, beads which bind the 3E7 antibody are selected using a high 
10 strength magnet. The selected beads are then analyzed by flow cytometry. Analysis of 
the selected beads reveals that they contain both fluorescein and R-phycoerythrin, 
indicating that only beads displaying the peptide YGGFL are selected by the 3E7 
antibody. 

15 EXAMPLE 2: SYNTHESIS ON GLASS BEADS OF 4 PENTAPEPTTDES TAGGED WITH 

OUGONUCLEOTIDE IDENTIFIERS 

A. "iiynfhPsis of Tdpntifier QUgnniirlentides m-(W) 

The oHgonudeotide identifier tags (I)-(IV) have tiie sequences shown 

below. The regions complementary to the 5' and 3' PGR primers are underlined. The 
2 0 regions complementary to the step-specific sequencing primers are shown in lower 

case: there are two steps in this example. The monomer encoding region is shown in 

bold type: Cly encodes Gly, TCT6 encodes L-Pro, and TTCTs encodes L-Tyr in tiiis 

case. Thus oligos (I)-(IV) code respectively for Gly in position 2, L-Pro in position 2, L- 
^ Tyr in position 1, and L-Pro in position 1. 
25 (J) y-RlT^-rTTTr rTCCrCTCCC T(-TTTTCirCTC^^^ i m iCTC 

C ' i ' lCri 1 1 1 i ICT CTCCCICTCrCCTCrCrCccctttctctcctttc 

r tcrrcTccrrTrTcrcTrcTn'CC- s' (SEQIdno:1) 

(H) q'-BlH24nrTTCr TCCTCTCCCTCTTrTCTCCTO 1 1 1 iCTC 
30 CmCl 1 A 1 1 IC TCTCCCTCTCTCCTCrCrCccctttctctcctttc 

r trrTTTrcrcTrTCiOTcnTCC- s' (SEQiDNoa) 

(m) y-BlB2-rTTTr TTCCTCTCCCTCnTrrCT^ i i i iCTC 

CTXTTCITriTCTCTCCCICICTCCTCrCTCtcttcctttcccctrt 
35 ^/^r-rr-rrrTrTCrCTCTrcnTCC- 3' (SEQIDN0:3) 
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(IV) S'.RlR2-rTTTC TTCCTCTCCCTC:i 1 L 1 CTCCTCr 1 CI 1" 1 ! 1 ICTC 
Cni ' Cl ' l 1 1 1 IC TCTCCCTCTCTCCTCTCTCtcttcctttcccctct 
rfrtr frrTrTrrTCTCTCTCTrCTTTCC- S' (SEQIDNO:4) 
5 where: Bl =p-Maleimid0-C6H4-(CH2)3-C(O)NH-{CH2)6-O-PO2-O-, and B2 = CH2- 
CH[(CH2)4-NH-Biotm]-CH2-0-P02-0-. 

Oligos (I)-(IV) are synthesized on an ABI PCR-mate synthesizer using 
commercially available (Sigma) DMT-O-Me phosphoramidites. The N4-amino group 
of cytidine is protected as the benzoyl derivative. The 5' termiiial (Bl) and 
1 0 penultimate (B2) phosphoramidites are respectively N-MMT-Ce-AminoModifer 
(Qonetech) and Biotin Phosphoramidite (Glen Research) for each oligonucleotide. 
The fully protected O-metiiyl phosphotriester oligomers are cleaved from the CPG 
support by treatment v»^ith concentrated NH4OH at 25 degrees C for 1 hour. The aude 
products are purified by affinity chromatography on a morusmeric avidin-agarose 

1 5 column (Pierce), and the full-length material is eluted with 2 mM biotin. The 5'- 

MMT group is removed by treatment with 80% acetic add for 1 hour at 25 degrees C, 
and the solution is evaporated to dryness. The products are dissolved in PBS, pH 8.0, 
and treated with a 50-fold excess of sucdnimidyl 4-(p-maieimidophenyl) butyrate 
(Pierce) in DMF for 30 min. The modified, protected oligonucleotides are desalted by 

2 0 RP-HPLC, lyophilized and stored under nitrogen. 

The primers used for PGR and sequencing are prepared in the normal 
fashion and are shown below: 

5' PGR Primer 5'-TGGTCTGCGTGTnTGTGGTGT-3' (corresponds to 

bases 7-28 of SEQ ID NO:l) 
25 3' PGR Primer 5'-Biotin-GGAAAGAAGAGAGAGAGGAGAGG-3' (SEQ 

IDN05) 

Step #1 Sequencing Primer 5'-AGAGAGGGGAAAGGAAGA-3' (SEQ ID 

N0:6) 

Step #2 Sequencing Primer 5'-AGGAAAGGAGAGAAAGGG-3' (SEQ ID 

30 N0:7) 

B. Prpparation of Glv-Glv-L-Phe-L-Leu Beads Beari ng Identifier Oligo (Tl 

5 mg of Qy-L-Phe-L-Leu beads are treated with a solution containing 
Fmoc-Gly-OH (99.95 ^mol), Fmoc-Cys(Npys)-OH (0.05 ^mol, Calbiochem), 
35 benzotriazol-l-yloxytris-(dimethylamino)phosphonium hexafluorophosphate (100 
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10 



umol), l-hydroxybenzotriazole hydrate (100 jimol), and diisopropyiethylamine (150 
^ol)indryDMF(lmL)£cr2hours. The beads are washed with DMF (2x) and then 
with methanol (2x) and then treated with a 10 mM DTT solution in methanol for 30 
min. to deprotect the cysteine residues. The beads are quickly washed with ice^id 
methanol (2x), peEeted, and then reacted for 20 min with 100 pi of a ^-l -M sol^n 
of oUgo (D in methanol. After washing with methanol (2x) and then with DMF (2x), 
the beads are deprotected for 20 min. with20% piperidine in DMF. Finally the beads 
are washed with DMF (2x), methanol (2x), and then methylene chloride (2x) and dned 
at 45 degrees C for 1 hour. 

C p.o p.r.Hnn nfL- P r -r.i y-T -PhP-T.-LPn BphHs Rparing Identifipr 0]igp OD 

5 mg of Gly-L-Phe-L-Leu beads are treated as in (b) above, substitutmg 
Fmoc-L-Pro^H and OUgo (H) for Fmoc-Gly-OH and OUgo a), respectively. 

15 D. P^opn -^'"" nf(nm- y-r yrJnw/J..Vrn)^v-JrVhe4.-U^X^^ds Bearing . 
T^^T^tifipr oligQs cm and I/m 

Beads from (b) and (c) are pooled and divided into two equal portions. 
One portion is treated as in (b), substituting Fmoc(OtBu)-L-Tyr-OH and OUgo (HI) as 
appropriate. 

E p.o p.r.Hnn nf T.-P- ^-fr-i y /T -Prn^■^,lv-T -PhP-T -T,Pii BeadS BParinff Trfentifier 

The second pool is treated as before, substitating Fmoc-L-Pro-OH and 
OUgo (IV) as appropriate. 



20 



25 

F Pprrmsritutio P and Deproter Hnri nf the Peptide Library 

Beads from (d) and (e) are pooled, and the phosphate, amino add side- 
chain, and nucleotide exocydic amino protecting groups are removed as foUows. A 
one hour treatment with a 1:2:2 mixture of thiophenok triethylamine: p-dioxane is 
30 foUowed by washing the beads with methanol (2x) and then methylene daoride(2x), 

and then the beads are treated for 5 min. with 95:5 trifluoroacetic add: ethanedithiol. 
After a wash with methanol (3x), the beads are treated at 55 degrees C with 1:1 
ethylenediamine: ethanol for 1 hour and then washed first with ethanol (2x) and 
then with PBS (2x). This coUection of beads constihites the Ubrary and contains 
35 approximately equal quantities of the 4 immobiUzed peptides YGGFL, YPGFL, PGGFL 
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and PPGFL. Additionally, each bead carries two distinct 113 bp oligonucleotide 
sequences encoding the identities of both the first and second amino acids of the 
peptide on that bead, 

5 G. PGR Amplification of Oligonucleotide Identifier Tag 

After a FAC sort of affinity purified beads into individual 0.5 mL 
polypropylene tubes, 25 fii of TBS containing 0.1 M-g sahnon sperm DNA (as carrier) 
are added together with 25 ^1 of 2X PGR Amplification Buffer (FECI) to each tube. The 
2X buffer contains: 100 mM KCl; 20 mM Tris-Cl, pH 8,4, 20 degrees C; 6 mM MgCl2; 
1 0 0.4 mM dNTP's; 1 of 5* PCR primer; 1 |iM of 3* PGR primer; and 100 tmits/ml Taq 
DNA polymerase. 

After buffer addition, the sample is covered with 50 \il of mineral oil and 
transferred to an automated thermal cycler. In the thermal cycler, the samples are 
heat denatured at 95 degrees G for 2 min., and then cycled 35 times through 3 steps: 95 

1 5 degrees G/30 sec, 60 degrees G/1 miru, 72 degrees C/1 min., which steps are followed 

by an incubation at 72 degrees G for an additional 5 min. and then the tubes are cooled 
and held at 15 degrees G until ready for processing on streptavidin beads. The 
mixture is heated to 95 degrees G to denature the strands, and the biotinyiated purine 
strand and excess 3' PGR primer are removed by addition of streptavidin-coated beads. 
20 The tubes are centrifuged at 12K rpm for 5 min. The supernatant is used in the 
sequencing reactions, as described below, 

H. Sequencing of PGR Amplified Oligonucleotide Tags 

The amplified oligonucleotides from individual bead isolates are 

2 5 sequenced in a pair of reactions (using ddA or ddG as chain terminators) with either 

the Step #l-specific or the Step #2-specific sequencing primers. 

To anneal the template and primer, for each set of two sequencing lanes, 
a single annealing and subsequent labeling reaction is run by combining 8.5 ^l of 
sequencing primer (conc= 0.25 pmol/^1), 1.5 ^il Sequenase^" 5X sequencing buffer 

3 0 (200 mM Tris HQ, pH 7,5; 100 mM MgQz; and 250 mM NaQ), and 10 ^1 of template 

DNA from the ampUfication supernatant above. The samples are heated for 2 
minutes at 65 degrees G and allowed to cool slowly to room temperature (approx. 10 
minutes). 

The labeling reaction is performed as follows. Sequenase^*^ (v2.0) is 
3 5 diluted 1 : 20 with TE (10 mM Tris HQ, pH 7.5; and 1 mM EDTA), and a labeling 
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cocktaU containing a 2 : 3.5 ratio of diluted enzyme to labeling mix (i.e., a 4 : 2 : 1 
mixture of 150 nM dGTP, 0.1 M dithiothreitol, aipha-35s^IP, >1000 Q/mmol) is 
prepared. About 5.5 fil of the cocktail are incubated with 10 nl of annealed 
template/primer (from (i)) at 25 degrees C for 5 min. 
5 ihe termination reactions are performed as follows. 6Mloflabelmg 

reaction mixture are added to 5 ^1 of each of the appropriate ddXTP termination 
reaction mixes (ie., 80 mM dGTP, 80 dAIP, 50 mM NaCl and 8 yM ddGTP or 8 
MM ddATP). After incubation at 37 degrees C for 5 min., about 8 fil of Stop Solution 
{95% formamide, 20 mM EDTA, 0.05% bromophenol blue, and 0.05% xylene cyanol) 

1 0 are added to each of the termination reactions. 

The sequencing gel is comprised of 6% total acryiamide (19:1 
acrylamide/bis), 0.09 M Tris base, 0.09 M boric add, 1 mM EDTA, and 7 M urea. The 
eel is polymerized by addition of 1.9 fil of 25% ammonium persulfate per mL and 0.72 
ll of lEMED per mL of above gel solution. The gel is allowed to polymerize at least 

1 5 one hour and is pr^run at least 20 minutes prior to sample loading. Gel plates are 
then maintained between 40 and 50 degrees C prior to and during the run. 

Reactions are heated to 85-95 degrees C for 2 minutes prior to loading, 
and the gel is run until the bromophenol blue dye reaches the bottom of the gel. The 
sequences of interest run between the bromophenol and xylene cyanol marlcers. TbB 

20 information required to identify the sequence of monomers in the oligomers attached 
to the bead is contained in the DNA sequence information. 

EXAMPLES: PARALLEL SYNTHESIS OF PEPTIDES AND OUGONUCLEOTIDE 

TAGS ON CARBOXYL BEADS 

25 A c; ynthesis of Phnsphoramidites ffl-(IV) 

The 3'-(allyl N,Ndiisopropyl-phosphoramidites) of S'-DMI derivatives 
of: (1) N6-(aUyIoxy)carbonyl-7-deaza-2'-deoxyadenosine; (2) NMaUyloxy)carbonyl-2'- 
deoxy-cytidine; (3) N2.(anyloxy)carbonyl-7-deaza-2'-deoxyguanosine; and (4) 
thymidine (see Figure 6), are prepared according to the procedures of Hayakawa et al., 

3 0 T A»,o. rhpm.Soc.112: 1691-1696 (1990), incorporated herein by reference). 

B. nor TTrattTin^ ParhnYvl Beaf ^« With a Diamine Linker 

Preparation of a bifunctional bead material for paraUel synthesis of 
peptides and oUgonudeotides is iUustrated in Hgure 7. Three 50 mg aUquots of 45 
35 urn diameter polystyrene/polydivinylbenzene/polymethyhnethacrylate/COOH beads 
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(Bangs* Laboratories) were each placed in a separate miaocentrifuge tube and treated 
as follows. First, the beads were treated with 0.1 N aqueous HCL (3 mL) and stirred by 
vortexing for 15 minutes. The beads were then pelleted with a microcentrifuge, the 
Uquid supernatant decanted, and the remaining bead pellet successively washed (by 
5 vortexing, pelleting, and decanting: a process referred to as "washed") with water (3 x 
1 mL) and dimethylformamide (DMF, 3x1 mL). 

The compounds 2-(lH-benzotriazol-l-yl)-l,l,3,3-tetramethyluronium 
hexafluorophosphate (HBTU, 38 mg, 0.10 mmol), 1-hydroxybenzotriazole (HOBT, 
15mg, 0.10 mmol), and DMF (0.5 mL) or dichloromethane (0.5 mL) were added to the 
1 0 bead pellet. Diisopropylethylamine (DIEA, 54 ^1, 030 mmol) was added, and the 

suspension was vortexed for 1 min. The compound 4,9-dioxa-l,12-dodecanediamine 
(20 \iX, 0.10 mmol) was then added, and the reaction was vortexed for 30 minutes. The 
reaction was then diluted with DMF (1 mL), the beads were pelleted, and the 
supernatant decanted. The pellet was treated with 9:1 DMF/water (1.0 mL) and 

1 5 vortexed for 15 minutes. The beads were then pelleted, the supernatant decanted, and 

the beads washed with DMF (3 x 1.0 mL). 

C Attaching Fpptide and Oligonucle otide Synthesis Linkers 

100 mg of the beads are treated with a mixture of 4-Fmoc-aininobutyric 

2 0 acid (0.1 mmol) and 4-p,p'-dimethoxytrityl (DMT)-hydroxybutyric acid (0.1 Mmol) in 

the presence of HBTU (0.1 mmol), HOBt (0.1 mmol), and DIEA (0.1 mmol) in 9:1 
CH2Cl2:DMF (1.0 mL). After vortex treatment for 30 minutes, the reaction mixture is 
diluted with DMF (1.0 mL), die beads pelleted, and the supernatant decanted. The 
beads are washed with DMF (3 x 1.0 mL). The coupling procedure is then repeated 

2 5 with fresh reagents, and the beads are pelleted and washed as described above. 

D. Buildin g a 3* VCR Priming Site on th e Hydroxy Linkers 

The parallel assembly of oligonudeotide-tagged peptides on beads is 
illustrated in Figure 8. A PGR priming site of 20-25 nucleotides is assembled as 

3 0 follows. Note that all reagents used are anhydrous, and reactions occur under an 

atmosphere of dry argon. About 10 mg of the beads are svibjected to an eight-step 
reaction sequence to couple a protected phosphoramidite. The reaction steps are: (1) 
beads are washed for 0.5 minutes with acetonitrile (MeCN); (2) DMT groups are 
removed with 3% trichloroacetic acid in CH2a2 for 1.5 minute; (3) beads are washed 
3 5 with MeCN for 3 minutes; (4) beads are treated with 0.1 M phosphoramidite (L H, m. 
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or IV) in MeCN containing either 0.5 M (4-nitrophenyl) tetrazole or 0.5 M pyridinium 
hydrochloride and 1.0 M imidazole for 2 min.; (5) beads are washed with MeCN for 
03 minutes; (6) beads are capped with a mixture of Ac20/2,6-lutidine/THF (1:1:8) 
containing 5% DMAP; (7) beads are oxidized with 1 M tBuOH in CHiQi for 0.8 
5 minutes; and (8) beads are washed for 0.5 minutes with MeCN. Steps one through 
eight are repeated from one to 25 times to assemble a PCR priming site of up to 25 
nucleotides. 

E. rmiplinff of First Amin o Acid to Amino Linkers 

^ 0 Peptide and nucleotide couplings may be alternated, as illustrated in 

Figure 8. To couple an amino add (or peptide), the Fmoc group is first removed from 
the beads by treatment witii 30% piperidine in DMF for 60 min. The beads are washed 
3 times with DMF. The beads are then ireaXed with a solution containing the 
appropriate amino add (0.1 M), HBTU (0.1 M), HOBt (0.1 M), and DIEA (0.1 M) in 9:1 

1 5 CH2a2:DMF for 30 min. The coupling is tiien repeated with fresh reagents for a 
further 30 min. and the beads are washed witia DMF (3x) and then with MeCN (3x). 

F. Constructinn of First Oligonug leottde "Codon" 

A "codon" of about 3 to 5 nucleotides uniquely representii^ the identity 
20 of the first amino add is then built at the 5' end of the oUgonucleotide drain using 
the 8-step coupling cyde in procedure (d) above. 

G. roiipling- of <^nhsequent A mino Adds anrf "Codon" Cnnsfaruction 

The metiaods of procedures (e) and (f) are then repeated using tiae 

2 5 appropriate amino add and nudeotide building blocks until tiie desired peptide and 

the o%onudeotide coding region are completely assembled. 

H. rnnstniction nf a 5' PCR Priming Site 

The 8-step couplir^ cycle of procedure (d) is used to build a 20-25 

3 0 nudeotide PCR priming site on tiie 5' terminus of tiie oUgonudeotide chains. 

L ne protection nf the Oligonucleotid e and Peptide Chains 

The fully assembled peptide and oligonudeotide chains are deprotected 
as foUows. The amino-terminal Fmoc groups are removed by ti^atment witii 30% 
3 5 piperidine in DMF and tiien a wash with TBiF (3x). To remove tiie allyHc protecting 
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groups, the beads are treated witti a THF solution containing 
tris(dibenzylideneacetone) dipalladium-chloroform complex (0.02 M), 
triphenylphosphine (0.2 M), and 1:1 n-butyiamine/formic add (1.2 M) at SCC for 30 
min. and the pelleted beads are washed with THF. The beads are washed with 0.1 M 
5 aqueous sodium N,N-diethyldithiocarbamate and then water to remove traces of 
palladium. The amino acid protecting groups are then removed by treatment with 
95:5 TFA/water for 30 min. "Scavenger" reagents such as 1,2-ethanedithiol aiui 
tiiioanisole may also be included in this acidic deprotection medivim (e.g., 2% of each 
by volume). Finally, the fully deprotected beads are washed with aqueous buftier and 
10 are ready for interaction with a biological receptor. 

EXAMPLE 4: LIBRARY PREPARATION AND SCREENING 

In this example, two populations of amine derivatized beads were 
labeled with oligonucleotides possessing base sequences uniquely characteristic of 

1 5 each bead population. The population labeled with an oligonucleotde 95 bases in 

length (95 mer) was subsequently coupled to the peptide YGGFL. The population of 
beads labeled with an oUgonudeotide 110 bases in length (110 mer) was coupled to 
phenyalanine (F). The beads were then mixed in the ratio of twenty F/110 mer 
beads for each YGGFL/95 mer bead and stained with a fluorescently labeled antibody 

2 0 3E7 that binds the peptide YGGFL with high affinity. Individual fluorescently 

stained beads could then be sorted by FACS directly into PGR hibes. After PGR, 5 of 6 
fluoresently stained beads gave rise to a fragment of amplified DNA 95 bp long. PGR 
of the remaining single bead gave rise to small DNA fragments, possibly being 
primer dimer. 

2 5 The oligonudeotides used in this experiment are the two tags, two PGR 

primers, and one sequencing primer. The same PGR and sequencing primers were 
used for the two tags. The two tags differ in their sequence and length. Both tags 
were composed of the bases 7-deazaA, C, aiul T. 
The 95 mer tag has the sequence: 

30 rr A CTC ACT AGC ACT GTA CTA TAA GG A GGG GTT GCT ATT GCA AAA TTA 
GAA Act tat etc aac tac ate tCA CAC TGA C TT ATC TCT AGA TGT AC (SEQ ID 
N0:8) 

The 110 mer tag has the sequence: 

rr A CTC ACT Arc act cta cta taa gc g tgg gct att gga aaa tta gat 

35 GCT ATT CCA A ^ A r a a Arf ta» rtr aar taf ate tCA CAC TCA CTC ATC TGT 



wo 93/06121 



PCr/US92/07815 



44 



ArATCTAC (SEQ ID NO:9) 

For each target the underlined sequences represent PGR primer binding sites: the 
sense primer is at the 5'-end, and the anti-sense primer is at the 3'-end. Also, for 
each target the smaU case sequence represents the sequencing primer binding site. 

5 

A. Bpari Pre paration 

Beads were purchased from Bang's Laboratories (979 Keystone Way, 
Carmel, IN 46032) and are composed of carboxylated polystyrene (4.5 Mm average 
diameti). These beads were subjected to diamine derivitization by the process 

1 0 described below. 

Beads (200 mg) were treated with 1.0 mL of 1 N HCl and vortexed 15 
min The beads were peUeted, decanted, and washed with three times (3x) with 1.0 
mL of water each wash and then washed 3x with 1.0 mL of DMF each wash. To the 
washed peUet was added 2-(lH-benzotriazol-l-yl)-l,U>tetramethyluronium 

1 5 hexafluorophosphate (HBTU; 38 mg, 0.1 mmole), l-hydroxybenzotriazole hydrate 
(HOST; 15 mg, 0.1 mmole), 500 nL of methylene chloride, and 54 pJ- of 
diisopropylefhylamine (DIEA; 0.3 mmole). After vortex treatment for 2 min., 20 jiL 
of diamine (4,9-dioxa-l,12-dodecanediamine; 94 nmole) were added. After vortex 
treatment for 30 min, 1.0 mL of DMF was added, and the beads were peUeted by 

20 centrifugation. The supernatant was removed, and 1-0 mL of 10% water in DMF 
was added. The beads were vortexed an additional 15 min. and finally washed 3x 
with 1.0 mL of DMF each wash. 

B. niigQnude "^'^p Attachment 

2 5 Two diferent target oUgonudeotides were employed in this 

experiment: a 95 mer and a 110 mer. These oUgonudeotides were composed of the 
bases cytidine, thymidine, and Z-deaza adenosine. The oUgonudeotides were 
synthesized with a primary amino group on the 5'-terminus (5'-amino modifier C- 
12, aen Researdi). LyophiUzed oUgonudeotide (600 pmole) was dissolved in 5 jiL 
30 of 05 M Na-phosphate, pH7.7, and Ae solution was treated with 10 pL of 0.2 M 
disucdnimydylsuberate (DSS). The reaction proceeded 10 min., and then 85 nL of 
ice-cold water were added. Unreacted DSS was removed by centrifugation. The 
supernatant was passed through a G25 spin column that had been equiUbrated with 
water. The eluant was immediately frozen and lyophiUzed to isolate the 5'-N- 

3 5 hydroxysucdnamide ester of the oUgonudeotide. 
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This activated oligonucleotide was dissolved in 50 M-L of 0.1 M Na- 
phosphate, pH 7.5, which contained 0.1 mg/mL of sonicated salmon sperm DNA. 
This solution was added to 10 mg of diamine derivitized beads. After vortex 
treatement for 3 hr., the beads were washed 2x with 0.4 mL of 0.1 M Na-phosphate, 
5 pH 7.5, each wash, and then washed 2x with 0.4 mL of 0.1 N NaOH. Finally, the 
beads were washed with 3x with 0.4 mL of pH 7.5 buffer. 

C PppriHP Attachment 

To Boc-YGGFL or Boc-Phe (Hoc = t-butoxy-carbonyl amine protecting 
1 0 group; 0.1 mmole) was added HBTU (0.1 mmole), HOBT (0.1 mmole), 1.0 mL of 10% 
DMF in methylene chloride, and DIEA (0.3 mmole). After vortex tireattnent to 
dissolve the solids, 0.4 mL of tiie peptide solution was added to 3 mg of 
oUgonucleotide-labeled beads. The solution containing Boc-YGGFL was added to 
beads labeled with the 95 mer, and ttie solution containing Boc-Phe was added to 

1 5 beads labeled with tiie 110 mer. The reaction mixtures were vortexed 30 min. and 

tiien diluted witti DMF, centrifuged, decanted, and the bead pellets washed witii 3x 
witii 1.0 mL of THF. The Boc protecting groups were removed by treating the beads 
with 0.4 mL of 95% trifluoroacetic acid for 10 miiu The deprotection reaction was 
then dUuted with THF, centrifuged, and decanted, and tiie beads were washed witii 
20 3x witii 1.0 mL of DMF each wash. FinaUy, the beads were washed witii 3x witii 0.5 
mL of 0.1 M Na-phosphate, pH 7.5, each wash and stored as a slurry (10 mg/mL). 

D. M^^^g- Stai pinfr. and Sorting 

The beads coupled witii tiie 95 mer and YGGFL were mixed witii tiie 

2 5 beads tiiat were coupled to tiie 110 mer and F in tiie ratio of 1:20. Thus, 0.1 mg of 95 

mer/YGGFL beads (2 million beads) were nuxed witii 2.0 mg of the 110 mer/Phe 
beads (40 miUion beads). The mixhire was suspended in blocking buffer (PBS, 1% 
BSA, and 0.05% Tween-20) and incubated at room temperahure for 1 hr. The beads 
were next pelleted by cenhifugation and resuspended in a solution of an FITC- 

3 0 labeled monoclonal antibody 3E7 tiiat binds tiie peptide YGGFL (1 ng/mL). The 

suspension was incubated 0.5 hr on ice and tiien centiifuged to isolate tiie bead 
pellet. 

The beads were resuspended in PBS for delivery into tiie fluorescence 
activated cell sorting (FACS) instrument (Becton Dickinson FACSORT Plus). Beads 
35 ttiat had bound to the fluorescentiy labeled antibody were identified by their 
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acquired fluorescence, and fluorescent beads were isolated by sorting into PGR tubes. 
One, ten, or one hundred fluorescent beads were sorted into each PGR tube. In an 
analogous manner, non-fluorescent beads were also sorted into PGR tubes. 

5 E. Am plificati "" qnrted Beads 

To each PGR tube containing a bead or beads was added 25 \sL of PGR 
buffer (20 mM Tris-HQ, pH 8.7; 10 mM KCL; 10 mM (NH4)2S04; 2 n.M MgCl^ 0.1% 
Triton X-100; 0.14 mg/mL BSA; 200 pm dATP; 200 nm dGIP; 200 ]mt dCIP; 200 jim 
dTTP; 2 m primer #1; 2 jim primer #2; and 0.5 units of Pfu DNA polymerase). 
1 0 Reactions were subjected to 40 cycles of 95°C for 0.5 min., 55°G for 1 min., and VT'C 

for 1 min. ^ , i 

Gel loading dye (2 \jL) was added to 10 ]iL of each PGR, and the sample 
was run on a 2% low melting point agarase gel. DNA fragments were visualized by 
staining with ethidium bromide and exposure to UV Ught. Five of six of the tubes 
1 5 containing single flourescent beads gave rise to DNA .ftagments 95 base pairs m 
length, confirming that these beads were coupled to YGGFL and not F. Tubes 
containing 10 or 100 fluorescent beads also gave rise to 95 mer DNA fragments. 
Gonversly, none of the tubes containing 1, 10, or 100 non-fluorescent beads gave rise 
to 95 mer ficagments. 

20 There were, however, anomalous amplification products smaller than 

110 bp from amplification of the tags of non-fluorescent beads. These anomalous 
products may have arisen through the use of unprotected oUgonucleotide tags in 
this example, which may have allowed the free exocydic amines to couple to the F 
- amino add, thereby rendering ti»e tag subject to anomalous amplification. This 

25 problem would not have affected the 95 mer tag to the same extent, because YGGFL 
would be less reactive with the exocydic amines than F. 

EXAMPLE 5: UBRARY SYNTHESIS AND SGREENING 

This example is illustrated schematically in Figure 9. Briefly, a single 

3 0 population of amine derivatized beads (prepared as described in Example 4) was 

coupled to glycine. The population was then divided into two equal parts, and each 
part was labeled witti a characteristic oUgonudeotide that would uniquely identify 
the bead subpopulation. The subpopulation that had been labeled with an 
oUgonudeotde 95 bases in length (the 95 mer described in Example 4) was 

35 subsequenfly coupled to the peptide YGOT:. The population of beads that had been 
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labeled with an oUgonucleotide 110 bases in length (the 110 mer described in 
Example 4) was coupled to the peptide FLFLF. (SEQ ID NO:16) The beads were then 
nuxed in the ratio of twenty FLFLF/110 mer beads for each YGGFL/95 mer bead (i.e., 
20:1) and stained with a fluorescently labeled antibody (3E7) that binds the peptide 
5 sequence YGGFL with high affinity. Individual fluorescently stained beads and 
unstained beads were sorted directly into PGR hibes. Upon PGR, all the fluoresentiy 
stained beads gave rise to a fragment of amplified DNA 95 base pairs in length, and 
all the unstained beads gave rise to a fragment 110 base pairs in length. 



10 A. Ppptide Coupling Step #1 

To Fmoc-Gly (Fmoc = 9-fluorenylmethoxycarbonyl amine protecting 
group; 0.1 mmole) was added HBTU (0.1 nunole), HOBT (0.1 mmole), 1.0 mL of 10% 
DMF in methylene chloride, and DIEA (0.3 mmole). After vortex treatment to 
dissolve the solids, 0.4 mL of the solution containing the activated amino acid was 

1 5 added to 50 mg of diamine derivatized beads. The reaction mixture was vortexed 30 

min. and ttien dUuted with DMF, centiifuged, decanted, and tiie bead peUet washed 
twice with 1.0 mL of DMF. The coupling reaction was tiien repeated. The beads 
were then treated witii 1.0 mL of 30% piperidine in DMF witii vortexing for 1 hr. to 
deprotect the glycine amino group. 

20 

B. Olignnucleotide Labeling 

Two different target oligonucleotides were employed in this 
experiment: the 95 mer and 110 mer described in Example 4. Half of the bead 
sample described above (25 mg) was labeled witii the 95 mer, and Hie other half was 

2 5 labeled witii tiie 110 mer. These oUgonudeotides are composed of 2'-deoxy-cytidine, 

tiiymidine, and 2'-deoxy-7-deaza-adenosine. The oUgonudeotides were syntiiesized 
witii a primary amino group on the 5-terminus (MMT-C12-Aminomodifier, 
Qonetech Laboratories, Inc). Lyophilized oUgonudeotide (15 nmole) was dissolved 
in 10 fiL of 0.5 M Na-phosphate, pH 7.7, and the solution was tiien treated witii 20 
30 jiL of 0.2 M disucdnimydylsuberate (DSS). The reaction proceeded 10 min., and 
then, 70 ^L of ice-cold water were added. Unreacted DSS was removed by 
centrifugation. The supernatant was passed tiurough a G-25 spm column tiiat had 
been equiUbrated witii water. The eluant was immediately frozen and lyophilized 
to isolate tiie S'-N-hydroxysucdnamide ester of the oUgonudeotide. This activated 

3 5 oUgonudeotide was dissolved in 100 \lL of 0.1 M Na-phosphate, pH 7.5, which 
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ccmtained 0.1 mg/mL of sonicated salmon sperm DNA. This solution was added to 
25 mg of glydneKioupkd beads. After vortex treatment for 3 hr., the beads were 
washed twice with 0.4 mL of 0.1 M Na-phosphate, pH 7.5, and twice with 0.4 mL of 
0.1 N NaOH. FinaUy, the beads were washed three times with 0.4 mL of pH 7.5 
5 buffer. 



C Pppride Cou pHnpr qfep «2 

To Boc-YGGFL or Boc-FLFLF (Boc = t-butoxy-carbonyl amine protecting 
group- 0.02 mmole) was added HBTU (0.02 mmole), HOST (0.02 mmole), 0.190 mL 

1 0 of 10% DMF in methylene chloride, and DIEA (0.06 mmole). After vortextreatment 
to dissolve the solids, the solution was diluted ten-fold in 10% DMF in methylene 
chloride. An aUquot of this solution (0.345 mL) was added to the glydne-coupled 
and oligonudeotide-labded beads (25 mg). The solution containing Boc-YGGFL was 
added to beads labeled wi& the 95 mer, and tiie solution containing Boc-FLFLF was 

1 5 added to beads labeled with the 110 mer. The reaction mixtures were vortexed 30 
min. and then diluted with DMF, centrifuged, decanted, and the bead peUets washed 
three times with 1.0 mL of THF. The Boc protecting groups were removed by 
treating the beads with 0.4 mL of 95% trifluoroacetic add for 10 min. The 
deprotection reaction was then diluted witii THF, centrifuged, decanted, and the 

20 beads washed three times witti 1.0 mL of DMF. FinaUy, tiie beads were washed tiiree 
times with 0.5 mL of 0.1 M Na-phosphate, pH 7.5, and stored as a slurry (10 mg/mL). 

D. Mixing. Staining, and Sorting 

The beads coupled to the 110 mer and FLFLF were mbced with tiie 

25 beads that were coupled to the 95 mer and YGGFL in tiie ratio of 20:1. Thus,0.1mg 
of 95 mer/ YGGFL beads (2 million beads) were mixed with 2.0 mg of the 110 
mer/FLFLF beads (40 milUon beads). The mbctiire was suspended in bloddng buffer 
(PBS, 1% BSA, 0.05% Tween-20) and incubated at room temperature for 1 hr. The 
beads were next pdleted by centrifugation and resuspended in a solution of an FTTC- 

3 0 labeled monodonal antibody (3E7) that recognizes the peptide sequence YGGFL (1 
Hg/mL). The suspension was incubated 05 hr. on ice and then centrifuged to isolate 
the bead peUet 

The beads were resuspended in PBS for delivery into tine fluorescence 
activated ceU sorting (FACS) instrument (Becton Dickinson FACSORT Plus). Beads 
35 that had bound to the fluorescentiy labeled antibody were identified by tiieir 
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acquired fluorescence (see Figure 10), and homogeneous samples of either 
fluorescent or non-fluorescent beads were isolated by sorting into PGR tubes. One, 
ten, or one hundred beads of each type were sorted into each PGR tube. 

5 E PTR nf Snrted Beads 

To each PGR tube containing a bead, or beads, was added 25 \iL of PGR 
mb< (20 mM Tris-HQ, pH 8.7, 10 mM KGL, 10 mM (NH4)2S04, 2 mM MgGl2, 0.1% 
Triton X-100, 0.14 mg/mL BSA, 200 |iM dATT, 200 ^iM dCTP, 200 dGTP, 200 \iM 
dTTP, 2 ^iM of each primer (as described in Example 4), and 0.5 units of Pfu DNA 

1 0 polymerase). Reactions were subjected to 40 cycles of 95''C for 0.5 min., 55°C for 1 
min., and 72°G for 1 min. Gel loading dye (2 ^iL) was added to 10 \lL of each PGR and 
the sample run on a 2% low melting point agarose gel. DNA fragments were 
visualized by staining with ethidium bromide and exposure to UV Ught Six single 
bead samples, three 10 bead samples, and three 100 bead samples were amplified 

1 5 from both the fluorescent and non-fluorescent populations. All the bead samples 
from the flourescent population produced only DNA fragments 95 base pairs in 
length, and all the samples from the non-fluorescent population produced only 
fragments 110 base pairs in length (see Figure 11). 
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SEQUENCE LISTING 

(1) GENERAL INFOEMATION: 

(L\ APPLICANT: DOWER r WILLIAM J 
^ ^ BABRETT, RONALD W 

GALLOP, MARK A 

NEEDELS, MICHAEL C 

fii^ TITLE OF INVENTION: METHOD OF SYNTHESIZING DIVERSE 
COLLECTIONS OF OLIGOMERS 

(iii) NUMBER OF SEQUENCES: 16 

fiv) CORRESPONDENCE ADDRESS: ,«,^«»Tf> 
Va) addressee: TOWNSEND AND TOWNSEND 

(B) STOEET: 1 MARKET PLAZA, STEUART TOWER, SUITE 2000 

(C) CITY: SAN FRANCISCO 

(D) STATE: CALIFORNIA 

(E) COUNTRY: USA 

(F) ZIP: 94105 

fv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy dxsk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS • ^ 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

fvi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



rviii^ ATTORNEY/AGENT INFORMATION: 
(A) NAME: smith, William M. 



(B) REGISTRATION NUMBER: 30,223 

(C) REFERENCE/DOCKET NUMBER: 11509-36-1 

fix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-543-9600 

(B) TELEFAX: 415-543-5043 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTCT TTTTTTCTCC TTCTTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC CCCTTTCTCT CCTTTCCTCC TCTCCTCTCT CTCTTCTTTC C 111 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTTC TTTTTTCTCC TTTCTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC CCCTTTCTCT CCTTTCCTCC TCTCCTCTCT CTCTTCTTTC C 111 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTTT CTTTTTCTCC TTTTCTTTTT CTCTCCCTCT 
CTCCTCTCTC TCTTCCTTTC CCCTCTCTCT CTCCTCTCCT CTCTCTCtTC TTTCC 



60 
115 
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(2) INFORMZITION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHAEACTERISTICS : 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CTTTCTTCCT CTCCCTCTTT TCTCCTCTTC TTTTTTCTCC TTTCTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC TCTTCCTTTC CCCTCTCTCT CTCCTCTCCT CTCTCTCTTC TTTCC 115 



(2) INFORMATION FOR SEQ ID NO: 5: 

ri) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGAAAGAAGA GAGAGAGGAG AGG 



(2) INFORMATION FOR SEQ ID NO: 6 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pedxs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

18 

AGAGAGGGGA AAGGAAGA 



wo 93/06121 



PCT/US92/0781S 



53 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGGAAAGGAG AGAAAGGG 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCACTCACTA CCACTCTACT ATAACCACCC CTTCCTATTC CAAAATTACA AACTTATCTC 60 
AACTACATCT CACACTCACT CATCTCTACA TCTAC ^5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
CCACTCACTA CCACTCTACT ATAACCCTCC CCTATTCCAA AATTACATCC TATTCCAAAA 60 
TTACAAACTT ATCTCAACTA CATCTCACAC TCACTCATCT CTACATCTAC 110 
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(2) lOTORM&TXON FOR SEQ ID MO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Gly Phe Leu 
1 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amincp acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO til: 

Pro Gly Phe Leu 

1 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Tyr Gly Gly Phe Leu 
1 ^ 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Tyr Pro Gly Phe Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Gly Gly Phe Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Pro Pro Gly Phe Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 16: 

ri) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Phe Leu Phe Leu Phe 

1 5 
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WE CLAIM: 

1. A synthetic oUgomer Ubrary comprising a plurality of different 
members, each member comprising an oligomer composed of a sequence of 
monomers linked to one or more identifier tags identifying the sequence of 
monomers in said oligomer. 

2. The Ubrary of daim 1, wherein said linkage between said oligomer 
and said identifier tag comprises a solid support. 

3. The library of claim 2, wherein said linkage between said oUgomer 
and said identifier tag comprises a linker between said identifier tag and said soUd 
support and a linker between said soUd support and said oUgomer. 

4. The Ubrary of daim 1, wherein said Unkage between said oligomer 
and said identifier tag comprises a Unker between said oUgomer and said identifier 
tag. 

5. The Ubrary of claim 2, wherein said Unkage between said oUgomer 
and said identifier tag comprises a Unker joining said oUgomer and said identifier tag 
to which Unker said soUd support is also joined. 

6. The Ubrary of claim 1, wherein said Unker comprises a first bead 
Unked to a second bead, wherein said oUgomer is attadied to the first bead and said 
identifier tag is attached to the second bead. 

7. The Ubrary of claim 1, wherein said identifier tag is attached to 
said oUgomer. 

8. The Ubrary of daim 1 that has about 106 different members. 

9. The Ubrary of daim 1, wherein said oUgomer is a peptide. 

10. The Ubrary of claim 1, wherein said identifier tag is a fluorescent 

marker. 
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11. Hie library of claim 1, wherein said identifier tag is an 
oligonucleotide. 

12. The library of daim 2, wherein: 

i) the Ubrary has greater than about 106 members; 

ii) the oligomer is a peptide; and 

iii) the identifier tag is an oUgonudeotide. 

13 A synthetic oligomer library produced by synthesizing on eadi of a 
pluraUty ofsolid supports a single oHgomer sequence, said oUgomer sequence bemg 
ifferentfor different solid supports, whidi library is synthesized in a process 
comprisingthe steps ^^^^^^^ ^^^^.^ , p,^aHty of reaction 

b) exposing said supports in eadi reaction vessel to a first 



vessels; 
monomer; 



vessels; 
monomer; and 
times. 



c) poolir^ said supports; 

d) apportioning said supports among a plurality of reaction 

e) exposing said supports in eadi reaction vessel to a second 

f) repeating steps a) through e) from at least one to twenty 



14 A tagged synthetic oUgomer library produced by syntiaesizing on 
eadi of a plurality of soUd supports a single oUgomer sequence and one or more 
identifier tags identifying said oUgomer sequence, said oUgomer sequence and 
identifier tags synthesized in a process comprising tiie steps of: 

a) apportioning said supports among a plurakty of readion 

vessels, exposing said supports in eadi reaction vessel to a first 

oUgomer monomer and to a first identifier tag; 

c) pooling said supports; 

d) apportioning said supports among a pluraUty of readion 

vessels; 
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e) exposing said supports to a second oligomer monomer and 
to a second identifier tag monomer; and 

f) repeating steps a) through e) from at least one to twenty 

times. 

5 

15. The library of claim 14, wherein said oligomer sequence is a 
peptide, and said identifier tag is an oligonucleotide. 

16. The library of claim 15, wherein said identifier tag comprises 
1 0 purine analog bases. 

17. The library of claim 14, wherein said oligomer is cleaved from the 
solid support after completion of oligomer synthesis. 

15 18. A method of preparing a tagged synthetic oligomer Ubrary 

comprising a piuraUty of different members, each member comprising a soUd support 
having attached a single oligomer sequence and one or more identifier tags 
identifying said oUgomer sequence, said method comprising the steps of: 

a) apportioning said supports among a plurality of reaction 

20 vessels; 

b) reacting said supports in each reaction vessel widi a first 

oligomer monomer; 

c) reacting said supports in each reaction vessel with a first 

identifier tag; 

25 d) pooling said supports; 

e) apportioning said pooled supports among a plurality of 

reaction vessels; 

f) reacting said pooled supports in each reaction vessel with a 

second oligomer monomer; 
30 g) reacting said pooled supports in each reaction vessel witii a 

second identifier tag; and 

h) repeating steps a) through g) from at least one to twenty 

times. 

35 19. The method of claim 18 wherein said solid supports in each 
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reaction, 
monomer 



vessel are reacted first with said identifier tag and then with said oUgomer 



20. A soUd support con^rising a first particle attached to a second 
particle, said first particle linked to an oUgomer and said second particle linked to an 
oUgonudeotide, and wherein said oUgomer is other than an oUgonudeotide. 

21. A method of recording each step in a sequence of oUgomer 
monomer additions in the synthesis of an oUgomer Ubrary, the method comprising 
adding an identifier tag in conjunction witii \he addition of eadi monomer, and 
performing at least two cydes of monomer and tag addition, thereby formmg a senes 
of identifier tags identifying said oUgomer sequence. 

22. The method of daim 21, wherein said identifier tag monomer is a 
fluorescent marker. 

23. The method of daim 21, wherein said identifier tag monomer is a 
nudeotide or oUgonudeotide. 

24. The metiiod of daim 23, wherein said oUgomer monomer is an 
amino add, peptide, or combination thereof. 

25. The method of daim 24, further comprising the steps of 

^ ampUfying said identifier tag sequence, thereby forming an ampUfied identifier tag; 
and sequencing said ampUfied identifier tag. 

26. Themethodof daim 21,wheiein after tiie addition of a first 
identifier tag, s^sequent identifier tags are attadied to the terminus of a preexisting 
tag. 

27. A method of synthesizing a peptide and an oUgonudeotide on a 

soUd support, said metiiod comprising: 

a) preparing a bifunctionalsoUd support containing a first type of 

adive site blodced with a first type of proteding group and a second type of adive site 
blocked with a second type of protecting group; 
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b) reacting said solid support with an activator to.remove said first 
type of protecting group thereby exposing said first type of active site; 

c) coupling an oligonucleotide monomer or an oligonucleotide to 

said first type of active site; 

d) reacting said solid support with an activator to remove said 
second type of protecting group thereby exposing said second type of active site; 

e) coupling a peptide monomer or peptide to said second type of 
active site; and 

f) repeating steps (b) through (e) from one to twenty times. 

28. The method of claim 27, wherein said solid support comprises 
polystyrene/polydivinylbenzene/polymethylmethacrylate CCXDH beads. 

29. The method of claim 27, wherein said Afunctional solid support 
contains an amine active site and a hydroxyl active site. 

30. The mettiod of claim 27, where said bifunctional solid support is 

produced by: 

(a) derivatizing polystyrene/polydivinylbenzene/ 
polymethyhnethaaylate COOH beads with a 4,9-dioxa-l,12-dodecanediamine linker; 

(b) treating said beads with 4-Fmoc-aminobutyric add and 4-p,p'- 
dimethoxytrityl-hydroxybutyric add in tiie presence of HBTU, HOBt and DIEA. 

31. The method of daim 27, wherein said oligonudeotide is 
assembled using 3'-{aUyl N,N'-diisopropyl-phosphoramidites) of 5'-DMT derivatives 
of: 

(a) N6-(allyloxy)carbonyl-7-deaza-2'deoxyadenosine; 

(b) N4-(allyloxy)carbonyl-2'deoxycytidine; 

(c) N2-(allyloxy)carbonyl-7-deaza-2'deoxyguanosine; and 

(d) thymidine. 



32. The method of daim 27, wherein said oligonudeotide is 
assembled using photochemically removable protecting groups on said nudeotides. 
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